Amino acid dipepetide frequency for Hubei picorna-like virus 52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.971AlaAla: 3.971 ± 0.0
1.059AlaCys: 1.059 ± 0.0
3.442AlaAsp: 3.442 ± 0.0
2.912AlaGlu: 2.912 ± 0.0
2.383AlaPhe: 2.383 ± 0.0
3.177AlaGly: 3.177 ± 0.0
0.265AlaHis: 0.265 ± 0.0
3.971AlaIle: 3.971 ± 0.0
3.177AlaLys: 3.177 ± 0.0
3.971AlaLeu: 3.971 ± 0.0
0.794AlaMet: 0.794 ± 0.0
2.648AlaAsn: 2.648 ± 0.0
2.118AlaPro: 2.118 ± 0.0
2.383AlaGln: 2.383 ± 0.0
2.912AlaArg: 2.912 ± 0.0
3.707AlaSer: 3.707 ± 0.0
2.912AlaThr: 2.912 ± 0.0
3.177AlaVal: 3.177 ± 0.0
0.53AlaTrp: 0.53 ± 0.0
1.589AlaTyr: 1.589 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.53CysAla: 0.53 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.589CysAsp: 1.589 ± 0.0
1.589CysGlu: 1.589 ± 0.0
1.059CysPhe: 1.059 ± 0.0
0.794CysGly: 0.794 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.53CysIle: 0.53 ± 0.0
1.853CysLys: 1.853 ± 0.0
1.324CysLeu: 1.324 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.265CysAsn: 0.265 ± 0.0
0.265CysPro: 0.265 ± 0.0
0.265CysGln: 0.265 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.794CysSer: 0.794 ± 0.0
0.265CysThr: 0.265 ± 0.0
0.794CysVal: 0.794 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.53CysTyr: 0.53 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.177AspAla: 3.177 ± 0.0
1.059AspCys: 1.059 ± 0.0
4.236AspAsp: 4.236 ± 0.0
4.236AspGlu: 4.236 ± 0.0
2.912AspPhe: 2.912 ± 0.0
2.118AspGly: 2.118 ± 0.0
0.794AspHis: 0.794 ± 0.0
3.707AspIle: 3.707 ± 0.0
4.236AspLys: 4.236 ± 0.0
5.825AspLeu: 5.825 ± 0.0
1.059AspMet: 1.059 ± 0.0
3.177AspAsn: 3.177 ± 0.0
2.648AspPro: 2.648 ± 0.0
2.912AspGln: 2.912 ± 0.0
1.853AspArg: 1.853 ± 0.0
3.177AspSer: 3.177 ± 0.0
4.236AspThr: 4.236 ± 0.0
3.971AspVal: 3.971 ± 0.0
0.53AspTrp: 0.53 ± 0.0
2.648AspTyr: 2.648 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.853GluAla: 1.853 ± 0.0
1.059GluCys: 1.059 ± 0.0
3.177GluAsp: 3.177 ± 0.0
3.971GluGlu: 3.971 ± 0.0
3.442GluPhe: 3.442 ± 0.0
1.589GluGly: 1.589 ± 0.0
1.324GluHis: 1.324 ± 0.0
6.354GluIle: 6.354 ± 0.0
3.707GluLys: 3.707 ± 0.0
5.295GluLeu: 5.295 ± 0.0
1.324GluMet: 1.324 ± 0.0
5.825GluAsn: 5.825 ± 0.0
3.177GluPro: 3.177 ± 0.0
2.383GluGln: 2.383 ± 0.0
3.177GluArg: 3.177 ± 0.0
2.648GluSer: 2.648 ± 0.0
4.236GluThr: 4.236 ± 0.0
4.236GluVal: 4.236 ± 0.0
0.265GluTrp: 0.265 ± 0.0
3.177GluTyr: 3.177 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.0
0.794PheCys: 0.794 ± 0.0
3.971PheAsp: 3.971 ± 0.0
3.971PheGlu: 3.971 ± 0.0
1.324PhePhe: 1.324 ± 0.0
2.383PheGly: 2.383 ± 0.0
1.059PheHis: 1.059 ± 0.0
2.383PheIle: 2.383 ± 0.0
4.766PheLys: 4.766 ± 0.0
2.912PheLeu: 2.912 ± 0.0
0.794PheMet: 0.794 ± 0.0
4.501PheAsn: 4.501 ± 0.0
1.589PhePro: 1.589 ± 0.0
2.648PheGln: 2.648 ± 0.0
1.324PheArg: 1.324 ± 0.0
2.912PheSer: 2.912 ± 0.0
5.295PheThr: 5.295 ± 0.0
4.236PheVal: 4.236 ± 0.0
0.53PheTrp: 0.53 ± 0.0
1.589PheTyr: 1.589 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.295GlyAla: 5.295 ± 0.0
0.265GlyCys: 0.265 ± 0.0
3.442GlyAsp: 3.442 ± 0.0
2.383GlyGlu: 2.383 ± 0.0
3.707GlyPhe: 3.707 ± 0.0
1.589GlyGly: 1.589 ± 0.0
0.794GlyHis: 0.794 ± 0.0
1.853GlyIle: 1.853 ± 0.0
3.971GlyLys: 3.971 ± 0.0
3.707GlyLeu: 3.707 ± 0.0
1.853GlyMet: 1.853 ± 0.0
2.383GlyAsn: 2.383 ± 0.0
1.059GlyPro: 1.059 ± 0.0
1.324GlyGln: 1.324 ± 0.0
0.53GlyArg: 0.53 ± 0.0
2.383GlySer: 2.383 ± 0.0
3.177GlyThr: 3.177 ± 0.0
2.118GlyVal: 2.118 ± 0.0
0.265GlyTrp: 0.265 ± 0.0
1.589GlyTyr: 1.589 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.265HisAla: 0.265 ± 0.0
0.265HisCys: 0.265 ± 0.0
0.794HisAsp: 0.794 ± 0.0
1.589HisGlu: 1.589 ± 0.0
1.853HisPhe: 1.853 ± 0.0
1.059HisGly: 1.059 ± 0.0
0.265HisHis: 0.265 ± 0.0
1.589HisIle: 1.589 ± 0.0
0.794HisLys: 0.794 ± 0.0
2.118HisLeu: 2.118 ± 0.0
0.265HisMet: 0.265 ± 0.0
1.324HisAsn: 1.324 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.324HisGln: 1.324 ± 0.0
0.794HisArg: 0.794 ± 0.0
1.059HisSer: 1.059 ± 0.0
1.853HisThr: 1.853 ± 0.0
0.265HisVal: 0.265 ± 0.0
0.53HisTrp: 0.53 ± 0.0
2.118HisTyr: 2.118 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.648IleAla: 2.648 ± 0.0
0.794IleCys: 0.794 ± 0.0
5.295IleAsp: 5.295 ± 0.0
4.501IleGlu: 4.501 ± 0.0
2.648IlePhe: 2.648 ± 0.0
2.118IleGly: 2.118 ± 0.0
1.059IleHis: 1.059 ± 0.0
6.354IleIle: 6.354 ± 0.0
7.149IleLys: 7.149 ± 0.0
6.619IleLeu: 6.619 ± 0.0
1.059IleMet: 1.059 ± 0.0
7.149IleAsn: 7.149 ± 0.0
4.501IlePro: 4.501 ± 0.0
4.766IleGln: 4.766 ± 0.0
1.589IleArg: 1.589 ± 0.0
6.354IleSer: 6.354 ± 0.0
4.501IleThr: 4.501 ± 0.0
3.971IleVal: 3.971 ± 0.0
0.53IleTrp: 0.53 ± 0.0
2.912IleTyr: 2.912 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.442LysAla: 3.442 ± 0.0
0.794LysCys: 0.794 ± 0.0
4.501LysAsp: 4.501 ± 0.0
5.295LysGlu: 5.295 ± 0.0
4.236LysPhe: 4.236 ± 0.0
2.648LysGly: 2.648 ± 0.0
2.118LysHis: 2.118 ± 0.0
4.501LysIle: 4.501 ± 0.0
4.766LysLys: 4.766 ± 0.0
7.149LysLeu: 7.149 ± 0.0
2.383LysMet: 2.383 ± 0.0
4.766LysAsn: 4.766 ± 0.0
1.853LysPro: 1.853 ± 0.0
3.442LysGln: 3.442 ± 0.0
2.912LysArg: 2.912 ± 0.0
3.971LysSer: 3.971 ± 0.0
4.501LysThr: 4.501 ± 0.0
5.56LysVal: 5.56 ± 0.0
0.0LysTrp: 0.0 ± 0.0
3.707LysTyr: 3.707 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.501LeuAla: 4.501 ± 0.0
1.059LeuCys: 1.059 ± 0.0
4.236LeuAsp: 4.236 ± 0.0
5.03LeuGlu: 5.03 ± 0.0
4.501LeuPhe: 4.501 ± 0.0
4.236LeuGly: 4.236 ± 0.0
2.648LeuHis: 2.648 ± 0.0
6.884LeuIle: 6.884 ± 0.0
6.884LeuLys: 6.884 ± 0.0
5.825LeuLeu: 5.825 ± 0.0
2.118LeuMet: 2.118 ± 0.0
9.531LeuAsn: 9.531 ± 0.0
3.177LeuPro: 3.177 ± 0.0
3.971LeuGln: 3.971 ± 0.0
3.442LeuArg: 3.442 ± 0.0
5.03LeuSer: 5.03 ± 0.0
5.03LeuThr: 5.03 ± 0.0
6.619LeuVal: 6.619 ± 0.0
0.265LeuTrp: 0.265 ± 0.0
3.442LeuTyr: 3.442 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.853MetAla: 1.853 ± 0.0
0.265MetCys: 0.265 ± 0.0
0.53MetAsp: 0.53 ± 0.0
1.853MetGlu: 1.853 ± 0.0
1.853MetPhe: 1.853 ± 0.0
0.53MetGly: 0.53 ± 0.0
0.794MetHis: 0.794 ± 0.0
1.324MetIle: 1.324 ± 0.0
0.53MetLys: 0.53 ± 0.0
2.648MetLeu: 2.648 ± 0.0
0.265MetMet: 0.265 ± 0.0
0.265MetAsn: 0.265 ± 0.0
1.059MetPro: 1.059 ± 0.0
3.177MetGln: 3.177 ± 0.0
1.059MetArg: 1.059 ± 0.0
1.324MetSer: 1.324 ± 0.0
0.794MetThr: 0.794 ± 0.0
1.059MetVal: 1.059 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.383MetTyr: 2.383 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.971AsnAla: 3.971 ± 0.0
0.794AsnCys: 0.794 ± 0.0
1.853AsnAsp: 1.853 ± 0.0
2.912AsnGlu: 2.912 ± 0.0
4.766AsnPhe: 4.766 ± 0.0
3.707AsnGly: 3.707 ± 0.0
0.53AsnHis: 0.53 ± 0.0
5.825AsnIle: 5.825 ± 0.0
3.971AsnLys: 3.971 ± 0.0
9.531AsnLeu: 9.531 ± 0.0
2.383AsnMet: 2.383 ± 0.0
4.766AsnAsn: 4.766 ± 0.0
2.648AsnPro: 2.648 ± 0.0
1.324AsnGln: 1.324 ± 0.0
1.853AsnArg: 1.853 ± 0.0
5.56AsnSer: 5.56 ± 0.0
5.295AsnThr: 5.295 ± 0.0
3.442AsnVal: 3.442 ± 0.0
0.265AsnTrp: 0.265 ± 0.0
4.766AsnTyr: 4.766 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.118ProAla: 2.118 ± 0.0
0.265ProCys: 0.265 ± 0.0
2.118ProAsp: 2.118 ± 0.0
1.059ProGlu: 1.059 ± 0.0
1.853ProPhe: 1.853 ± 0.0
1.589ProGly: 1.589 ± 0.0
0.265ProHis: 0.265 ± 0.0
2.118ProIle: 2.118 ± 0.0
2.912ProLys: 2.912 ± 0.0
3.177ProLeu: 3.177 ± 0.0
0.53ProMet: 0.53 ± 0.0
2.648ProAsn: 2.648 ± 0.0
1.324ProPro: 1.324 ± 0.0
0.265ProGln: 0.265 ± 0.0
2.383ProArg: 2.383 ± 0.0
3.177ProSer: 3.177 ± 0.0
3.177ProThr: 3.177 ± 0.0
3.442ProVal: 3.442 ± 0.0
0.265ProTrp: 0.265 ± 0.0
0.794ProTyr: 0.794 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.853GlnAla: 1.853 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.177GlnAsp: 3.177 ± 0.0
3.707GlnGlu: 3.707 ± 0.0
1.589GlnPhe: 1.589 ± 0.0
2.383GlnGly: 2.383 ± 0.0
1.589GlnHis: 1.589 ± 0.0
5.825GlnIle: 5.825 ± 0.0
2.118GlnLys: 2.118 ± 0.0
5.56GlnLeu: 5.56 ± 0.0
2.118GlnMet: 2.118 ± 0.0
3.707GlnAsn: 3.707 ± 0.0
1.059GlnPro: 1.059 ± 0.0
1.853GlnGln: 1.853 ± 0.0
2.912GlnArg: 2.912 ± 0.0
2.383GlnSer: 2.383 ± 0.0
3.707GlnThr: 3.707 ± 0.0
1.589GlnVal: 1.589 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.794GlnTyr: 0.794 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.118ArgAla: 2.118 ± 0.0
0.794ArgCys: 0.794 ± 0.0
2.383ArgAsp: 2.383 ± 0.0
1.853ArgGlu: 1.853 ± 0.0
1.059ArgPhe: 1.059 ± 0.0
1.324ArgGly: 1.324 ± 0.0
1.059ArgHis: 1.059 ± 0.0
5.295ArgIle: 5.295 ± 0.0
1.589ArgLys: 1.589 ± 0.0
3.442ArgLeu: 3.442 ± 0.0
0.53ArgMet: 0.53 ± 0.0
2.912ArgAsn: 2.912 ± 0.0
0.265ArgPro: 0.265 ± 0.0
2.912ArgGln: 2.912 ± 0.0
2.912ArgArg: 2.912 ± 0.0
2.383ArgSer: 2.383 ± 0.0
3.971ArgThr: 3.971 ± 0.0
2.912ArgVal: 2.912 ± 0.0
0.53ArgTrp: 0.53 ± 0.0
1.059ArgTyr: 1.059 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.707SerAla: 3.707 ± 0.0
0.794SerCys: 0.794 ± 0.0
1.853SerAsp: 1.853 ± 0.0
3.442SerGlu: 3.442 ± 0.0
3.971SerPhe: 3.971 ± 0.0
2.648SerGly: 2.648 ± 0.0
1.059SerHis: 1.059 ± 0.0
6.089SerIle: 6.089 ± 0.0
5.295SerLys: 5.295 ± 0.0
5.825SerLeu: 5.825 ± 0.0
0.794SerMet: 0.794 ± 0.0
2.912SerAsn: 2.912 ± 0.0
3.177SerPro: 3.177 ± 0.0
4.236SerGln: 4.236 ± 0.0
2.118SerArg: 2.118 ± 0.0
4.501SerSer: 4.501 ± 0.0
3.442SerThr: 3.442 ± 0.0
5.295SerVal: 5.295 ± 0.0
0.265SerTrp: 0.265 ± 0.0
2.648SerTyr: 2.648 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.118ThrAla: 2.118 ± 0.0
0.53ThrCys: 0.53 ± 0.0
5.03ThrAsp: 5.03 ± 0.0
3.442ThrGlu: 3.442 ± 0.0
4.236ThrPhe: 4.236 ± 0.0
2.912ThrGly: 2.912 ± 0.0
1.853ThrHis: 1.853 ± 0.0
3.707ThrIle: 3.707 ± 0.0
5.825ThrLys: 5.825 ± 0.0
6.089ThrLeu: 6.089 ± 0.0
2.383ThrMet: 2.383 ± 0.0
3.707ThrAsn: 3.707 ± 0.0
3.442ThrPro: 3.442 ± 0.0
4.236ThrGln: 4.236 ± 0.0
2.912ThrArg: 2.912 ± 0.0
5.56ThrSer: 5.56 ± 0.0
6.089ThrThr: 6.089 ± 0.0
3.971ThrVal: 3.971 ± 0.0
0.53ThrTrp: 0.53 ± 0.0
2.383ThrTyr: 2.383 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.118ValAla: 2.118 ± 0.0
0.794ValCys: 0.794 ± 0.0
3.971ValAsp: 3.971 ± 0.0
3.971ValGlu: 3.971 ± 0.0
2.118ValPhe: 2.118 ± 0.0
3.442ValGly: 3.442 ± 0.0
0.794ValHis: 0.794 ± 0.0
5.295ValIle: 5.295 ± 0.0
6.619ValLys: 6.619 ± 0.0
3.971ValLeu: 3.971 ± 0.0
1.324ValMet: 1.324 ± 0.0
5.295ValAsn: 5.295 ± 0.0
0.794ValPro: 0.794 ± 0.0
3.177ValGln: 3.177 ± 0.0
3.442ValArg: 3.442 ± 0.0
4.236ValSer: 4.236 ± 0.0
5.03ValThr: 5.03 ± 0.0
2.118ValVal: 2.118 ± 0.0
0.794ValTrp: 0.794 ± 0.0
2.118ValTyr: 2.118 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.0
0.265TrpCys: 0.265 ± 0.0
0.265TrpAsp: 0.265 ± 0.0
0.794TrpGlu: 0.794 ± 0.0
0.53TrpPhe: 0.53 ± 0.0
0.794TrpGly: 0.794 ± 0.0
0.265TrpHis: 0.265 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.53TrpLys: 0.53 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.265TrpPro: 0.265 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.53TrpSer: 0.53 ± 0.0
1.059TrpThr: 1.059 ± 0.0
0.265TrpVal: 0.265 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.265TrpTyr: 0.265 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.853TyrAla: 1.853 ± 0.0
0.794TyrCys: 0.794 ± 0.0
2.648TyrAsp: 2.648 ± 0.0
3.707TyrGlu: 3.707 ± 0.0
1.853TyrPhe: 1.853 ± 0.0
2.648TyrGly: 2.648 ± 0.0
1.589TyrHis: 1.589 ± 0.0
3.177TyrIle: 3.177 ± 0.0
1.853TyrLys: 1.853 ± 0.0
3.442TyrLeu: 3.442 ± 0.0
1.324TyrMet: 1.324 ± 0.0
2.383TyrAsn: 2.383 ± 0.0
1.059TyrPro: 1.059 ± 0.0
1.324TyrGln: 1.324 ± 0.0
3.177TyrArg: 3.177 ± 0.0
2.383TyrSer: 2.383 ± 0.0
2.383TyrThr: 2.383 ± 0.0
2.383TyrVal: 2.383 ± 0.0
0.53TyrTrp: 0.53 ± 0.0
0.53TyrTyr: 0.53 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3778 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski