Amino acid dipepetide frequency for Hubei sobemo-like virus 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.896AlaAla: 5.896 ± 3.597
1.179AlaCys: 1.179 ± 0.923
3.538AlaAsp: 3.538 ± 0.516
7.075AlaGlu: 7.075 ± 0.61
1.179AlaPhe: 1.179 ± 0.923
5.896AlaGly: 5.896 ± 0.313
0.0AlaHis: 0.0 ± 0.0
4.717AlaIle: 4.717 ± 1.235
7.075AlaLys: 7.075 ± 2.252
4.717AlaLeu: 4.717 ± 0.407
2.358AlaMet: 2.358 ± 0.203
1.179AlaAsn: 1.179 ± 0.719
4.717AlaPro: 4.717 ± 0.407
4.717AlaGln: 4.717 ± 0.407
2.358AlaArg: 2.358 ± 1.439
2.358AlaSer: 2.358 ± 0.203
2.358AlaThr: 2.358 ± 0.203
2.358AlaVal: 2.358 ± 0.203
2.358AlaTrp: 2.358 ± 0.203
2.358AlaTyr: 2.358 ± 0.203
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.179CysAsp: 1.179 ± 0.719
1.179CysGlu: 1.179 ± 0.719
0.0CysPhe: 0.0 ± 0.0
1.179CysGly: 1.179 ± 0.923
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.358CysLeu: 2.358 ± 1.439
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.179CysPro: 1.179 ± 0.923
1.179CysGln: 1.179 ± 0.719
3.538CysArg: 3.538 ± 1.126
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.179CysVal: 1.179 ± 0.923
0.0CysTrp: 0.0 ± 0.0
1.179CysTyr: 1.179 ± 0.923
0.0CysXaa: 0.0 ± 0.0
Asp
4.717AspAla: 4.717 ± 0.407
0.0AspCys: 0.0 ± 0.0
2.358AspAsp: 2.358 ± 1.845
3.538AspGlu: 3.538 ± 1.126
0.0AspPhe: 0.0 ± 0.0
1.179AspGly: 1.179 ± 0.719
0.0AspHis: 0.0 ± 0.0
3.538AspIle: 3.538 ± 1.126
2.358AspLys: 2.358 ± 1.439
2.358AspLeu: 2.358 ± 1.439
0.0AspMet: 0.0 ± 0.0
2.358AspAsn: 2.358 ± 0.203
2.358AspPro: 2.358 ± 0.203
2.358AspGln: 2.358 ± 1.439
1.179AspArg: 1.179 ± 0.719
5.896AspSer: 5.896 ± 0.313
2.358AspThr: 2.358 ± 1.439
4.717AspVal: 4.717 ± 0.407
3.538AspTrp: 3.538 ± 1.126
3.538AspTyr: 3.538 ± 0.516
0.0AspXaa: 0.0 ± 0.0
Glu
5.896GluAla: 5.896 ± 0.313
1.179GluCys: 1.179 ± 0.923
2.358GluAsp: 2.358 ± 1.439
1.179GluGlu: 1.179 ± 0.719
3.538GluPhe: 3.538 ± 0.516
2.358GluGly: 2.358 ± 1.439
1.179GluHis: 1.179 ± 0.923
3.538GluIle: 3.538 ± 1.126
0.0GluLys: 0.0 ± 0.0
8.255GluLeu: 8.255 ± 3.394
0.0GluMet: 0.0 ± 0.0
1.179GluAsn: 1.179 ± 0.923
3.538GluPro: 3.538 ± 1.126
3.538GluGln: 3.538 ± 0.516
4.717GluArg: 4.717 ± 1.235
7.075GluSer: 7.075 ± 1.032
3.538GluThr: 3.538 ± 0.516
3.538GluVal: 3.538 ± 1.126
1.179GluTrp: 1.179 ± 0.719
3.538GluTyr: 3.538 ± 2.158
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.179PheCys: 1.179 ± 0.923
1.179PheAsp: 1.179 ± 0.923
2.358PheGlu: 2.358 ± 0.203
0.0PhePhe: 0.0 ± 0.0
4.717PheGly: 4.717 ± 0.407
0.0PheHis: 0.0 ± 0.0
2.358PheIle: 2.358 ± 1.845
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
3.538PheMet: 3.538 ± 2.768
2.358PheAsn: 2.358 ± 1.845
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.358PheArg: 2.358 ± 0.203
5.896PheSer: 5.896 ± 0.313
1.179PheThr: 1.179 ± 0.923
2.358PheVal: 2.358 ± 1.845
3.538PheTrp: 3.538 ± 0.516
1.179PheTyr: 1.179 ± 0.923
0.0PheXaa: 0.0 ± 0.0
Gly
4.717GlyAla: 4.717 ± 2.049
2.358GlyCys: 2.358 ± 0.203
1.179GlyAsp: 1.179 ± 0.923
3.538GlyGlu: 3.538 ± 2.158
5.896GlyPhe: 5.896 ± 0.313
9.434GlyGly: 9.434 ± 2.471
2.358GlyHis: 2.358 ± 0.203
0.0GlyIle: 0.0 ± 0.0
8.255GlyLys: 8.255 ± 1.752
5.896GlyLeu: 5.896 ± 1.329
2.358GlyMet: 2.358 ± 0.48
2.358GlyAsn: 2.358 ± 1.439
1.179GlyPro: 1.179 ± 0.719
4.717GlyGln: 4.717 ± 1.235
3.538GlyArg: 3.538 ± 2.158
3.538GlySer: 3.538 ± 1.126
1.179GlyThr: 1.179 ± 0.719
7.075GlyVal: 7.075 ± 1.032
3.538GlyTrp: 3.538 ± 2.768
5.896GlyTyr: 5.896 ± 1.329
0.0GlyXaa: 0.0 ± 0.0
His
1.179HisAla: 1.179 ± 0.719
1.179HisCys: 1.179 ± 0.719
2.358HisAsp: 2.358 ± 0.203
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.358HisGly: 2.358 ± 0.203
1.179HisHis: 1.179 ± 0.719
1.179HisIle: 1.179 ± 0.719
2.358HisLys: 2.358 ± 1.845
5.896HisLeu: 5.896 ± 1.329
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.179HisPro: 1.179 ± 0.719
0.0HisGln: 0.0 ± 0.0
1.179HisArg: 1.179 ± 0.923
3.538HisSer: 3.538 ± 1.126
0.0HisThr: 0.0 ± 0.0
1.179HisVal: 1.179 ± 0.719
1.179HisTrp: 1.179 ± 0.923
2.358HisTyr: 2.358 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
7.075IleAla: 7.075 ± 5.536
0.0IleCys: 0.0 ± 0.0
1.179IleAsp: 1.179 ± 0.719
2.358IleGlu: 2.358 ± 1.439
1.179IlePhe: 1.179 ± 0.923
4.717IleGly: 4.717 ± 1.235
2.358IleHis: 2.358 ± 0.203
2.358IleIle: 2.358 ± 1.845
0.0IleLys: 0.0 ± 0.0
1.179IleLeu: 1.179 ± 0.719
0.0IleMet: 0.0 ± 0.0
2.358IleAsn: 2.358 ± 1.439
5.896IlePro: 5.896 ± 1.955
0.0IleGln: 0.0 ± 0.0
2.358IleArg: 2.358 ± 0.203
3.538IleSer: 3.538 ± 1.126
1.179IleThr: 1.179 ± 0.719
4.717IleVal: 4.717 ± 1.235
0.0IleTrp: 0.0 ± 0.0
2.358IleTyr: 2.358 ± 0.203
0.0IleXaa: 0.0 ± 0.0
Lys
1.179LysAla: 1.179 ± 0.923
1.179LysCys: 1.179 ± 0.719
2.358LysAsp: 2.358 ± 0.203
8.255LysGlu: 8.255 ± 1.533
2.358LysPhe: 2.358 ± 0.203
2.358LysGly: 2.358 ± 0.203
3.538LysHis: 3.538 ± 1.126
7.075LysIle: 7.075 ± 2.674
0.0LysLys: 0.0 ± 0.0
4.717LysLeu: 4.717 ± 0.407
0.0LysMet: 0.0 ± 0.0
1.179LysAsn: 1.179 ± 0.719
3.538LysPro: 3.538 ± 0.516
1.179LysGln: 1.179 ± 0.719
1.179LysArg: 1.179 ± 0.719
9.434LysSer: 9.434 ± 0.813
2.358LysThr: 2.358 ± 0.203
2.358LysVal: 2.358 ± 1.439
1.179LysTrp: 1.179 ± 0.923
2.358LysTyr: 2.358 ± 1.845
0.0LysXaa: 0.0 ± 0.0
Leu
11.792LeuAla: 11.792 ± 0.625
0.0LeuCys: 0.0 ± 0.0
5.896LeuAsp: 5.896 ± 0.313
4.717LeuGlu: 4.717 ± 1.235
4.717LeuPhe: 4.717 ± 3.691
4.717LeuGly: 4.717 ± 0.407
1.179LeuHis: 1.179 ± 0.719
1.179LeuIle: 1.179 ± 0.923
7.075LeuLys: 7.075 ± 4.316
12.972LeuLeu: 12.972 ± 2.987
4.717LeuMet: 4.717 ± 2.878
3.538LeuAsn: 3.538 ± 1.126
3.538LeuPro: 3.538 ± 1.126
2.358LeuGln: 2.358 ± 1.439
5.896LeuArg: 5.896 ± 0.313
1.179LeuSer: 1.179 ± 0.719
4.717LeuThr: 4.717 ± 0.407
3.538LeuVal: 3.538 ± 2.158
1.179LeuTrp: 1.179 ± 0.923
7.075LeuTyr: 7.075 ± 0.61
0.0LeuXaa: 0.0 ± 0.0
Met
1.179MetAla: 1.179 ± 0.923
0.0MetCys: 0.0 ± 0.0
2.358MetAsp: 2.358 ± 0.203
1.179MetGlu: 1.179 ± 0.719
0.0MetPhe: 0.0 ± 0.0
3.538MetGly: 3.538 ± 1.126
3.538MetHis: 3.538 ± 0.516
0.0MetIle: 0.0 ± 0.0
2.358MetLys: 2.358 ± 1.845
1.179MetLeu: 1.179 ± 0.923
1.179MetMet: 1.179 ± 0.923
2.358MetAsn: 2.358 ± 1.439
1.179MetPro: 1.179 ± 0.719
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.358MetSer: 2.358 ± 1.439
1.179MetThr: 1.179 ± 0.923
3.538MetVal: 3.538 ± 1.126
0.0MetTrp: 0.0 ± 0.0
3.538MetTyr: 3.538 ± 1.126
0.0MetXaa: 0.0 ± 0.0
Asn
1.179AsnAla: 1.179 ± 0.719
0.0AsnCys: 0.0 ± 0.0
2.358AsnAsp: 2.358 ± 1.439
2.358AsnGlu: 2.358 ± 0.203
1.179AsnPhe: 1.179 ± 0.923
7.075AsnGly: 7.075 ± 2.674
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
3.538AsnLys: 3.538 ± 0.516
2.358AsnLeu: 2.358 ± 0.203
1.179AsnMet: 1.179 ± 0.923
0.0AsnAsn: 0.0 ± 0.0
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
3.538AsnArg: 3.538 ± 0.516
3.538AsnSer: 3.538 ± 1.126
1.179AsnThr: 1.179 ± 0.923
2.358AsnVal: 2.358 ± 0.203
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.358ProAla: 2.358 ± 0.203
1.179ProCys: 1.179 ± 0.719
1.179ProAsp: 1.179 ± 0.719
2.358ProGlu: 2.358 ± 0.203
1.179ProPhe: 1.179 ± 0.923
8.255ProGly: 8.255 ± 1.752
1.179ProHis: 1.179 ± 0.923
0.0ProIle: 0.0 ± 0.0
2.358ProLys: 2.358 ± 1.439
4.717ProLeu: 4.717 ± 1.235
2.358ProMet: 2.358 ± 0.203
0.0ProAsn: 0.0 ± 0.0
1.179ProPro: 1.179 ± 0.719
1.179ProGln: 1.179 ± 0.719
4.717ProArg: 4.717 ± 1.235
1.179ProSer: 1.179 ± 0.923
3.538ProThr: 3.538 ± 0.516
4.717ProVal: 4.717 ± 2.049
0.0ProTrp: 0.0 ± 0.0
2.358ProTyr: 2.358 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
5.896GlnAla: 5.896 ± 1.955
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.538GlnGlu: 3.538 ± 2.158
2.358GlnPhe: 2.358 ± 1.845
1.179GlnGly: 1.179 ± 0.719
0.0GlnHis: 0.0 ± 0.0
1.179GlnIle: 1.179 ± 0.719
3.538GlnLys: 3.538 ± 0.516
3.538GlnLeu: 3.538 ± 0.516
2.358GlnMet: 2.358 ± 0.203
1.179GlnAsn: 1.179 ± 0.923
2.358GlnPro: 2.358 ± 1.439
1.179GlnGln: 1.179 ± 0.719
1.179GlnArg: 1.179 ± 0.719
2.358GlnSer: 2.358 ± 0.203
0.0GlnThr: 0.0 ± 0.0
3.538GlnVal: 3.538 ± 0.516
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.896ArgAla: 5.896 ± 0.313
0.0ArgCys: 0.0 ± 0.0
4.717ArgAsp: 4.717 ± 1.235
3.538ArgGlu: 3.538 ± 1.126
1.179ArgPhe: 1.179 ± 0.923
2.358ArgGly: 2.358 ± 1.439
0.0ArgHis: 0.0 ± 0.0
2.358ArgIle: 2.358 ± 0.203
0.0ArgLys: 0.0 ± 0.0
9.434ArgLeu: 9.434 ± 2.471
2.358ArgMet: 2.358 ± 1.845
2.358ArgAsn: 2.358 ± 1.439
2.358ArgPro: 2.358 ± 1.439
2.358ArgGln: 2.358 ± 0.203
7.075ArgArg: 7.075 ± 4.316
3.538ArgSer: 3.538 ± 0.516
3.538ArgThr: 3.538 ± 0.516
5.896ArgVal: 5.896 ± 0.313
3.538ArgTrp: 3.538 ± 1.126
1.179ArgTyr: 1.179 ± 0.923
0.0ArgXaa: 0.0 ± 0.0
Ser
2.358SerAla: 2.358 ± 0.203
0.0SerCys: 0.0 ± 0.0
3.538SerAsp: 3.538 ± 0.516
1.179SerGlu: 1.179 ± 0.719
2.358SerPhe: 2.358 ± 0.203
11.792SerGly: 11.792 ± 1.017
1.179SerHis: 1.179 ± 0.719
1.179SerIle: 1.179 ± 0.719
7.075SerLys: 7.075 ± 0.61
5.896SerLeu: 5.896 ± 1.329
0.0SerMet: 0.0 ± 0.0
4.717SerAsn: 4.717 ± 1.235
4.717SerPro: 4.717 ± 0.407
2.358SerGln: 2.358 ± 1.439
7.075SerArg: 7.075 ± 2.252
11.792SerSer: 11.792 ± 4.301
3.538SerThr: 3.538 ± 0.516
2.358SerVal: 2.358 ± 1.439
2.358SerTrp: 2.358 ± 1.845
4.717SerTyr: 4.717 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.0ThrCys: 0.0 ± 0.0
2.358ThrAsp: 2.358 ± 1.845
3.538ThrGlu: 3.538 ± 0.516
0.0ThrPhe: 0.0 ± 0.0
2.358ThrGly: 2.358 ± 1.845
2.358ThrHis: 2.358 ± 1.439
4.717ThrIle: 4.717 ± 0.407
2.358ThrLys: 2.358 ± 1.845
3.538ThrLeu: 3.538 ± 1.126
1.179ThrMet: 1.179 ± 0.923
0.0ThrAsn: 0.0 ± 0.0
2.358ThrPro: 2.358 ± 1.439
3.538ThrGln: 3.538 ± 0.516
2.358ThrArg: 2.358 ± 1.439
3.538ThrSer: 3.538 ± 2.158
0.0ThrThr: 0.0 ± 0.0
3.538ThrVal: 3.538 ± 0.516
1.179ThrTrp: 1.179 ± 0.719
1.179ThrTyr: 1.179 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
1.179ValAla: 1.179 ± 0.719
1.179ValCys: 1.179 ± 0.719
3.538ValAsp: 3.538 ± 2.158
5.896ValGlu: 5.896 ± 1.955
2.358ValPhe: 2.358 ± 1.845
2.358ValGly: 2.358 ± 1.845
1.179ValHis: 1.179 ± 0.923
3.538ValIle: 3.538 ± 0.516
5.896ValLys: 5.896 ± 1.329
4.717ValLeu: 4.717 ± 1.235
2.358ValMet: 2.358 ± 0.438
3.538ValAsn: 3.538 ± 1.126
2.358ValPro: 2.358 ± 0.203
3.538ValGln: 3.538 ± 1.126
5.896ValArg: 5.896 ± 0.313
5.896ValSer: 5.896 ± 0.313
5.896ValThr: 5.896 ± 1.955
4.717ValVal: 4.717 ± 2.049
1.179ValTrp: 1.179 ± 0.923
1.179ValTyr: 1.179 ± 0.719
0.0ValXaa: 0.0 ± 0.0
Trp
3.538TrpAla: 3.538 ± 2.158
1.179TrpCys: 1.179 ± 0.923
1.179TrpAsp: 1.179 ± 0.923
0.0TrpGlu: 0.0 ± 0.0
3.538TrpPhe: 3.538 ± 1.126
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.179TrpIle: 1.179 ± 0.923
1.179TrpLys: 1.179 ± 0.719
1.179TrpLeu: 1.179 ± 0.923
1.179TrpMet: 1.179 ± 0.923
1.179TrpAsn: 1.179 ± 0.923
1.179TrpPro: 1.179 ± 0.923
0.0TrpGln: 0.0 ± 0.0
3.538TrpArg: 3.538 ± 2.768
1.179TrpSer: 1.179 ± 0.719
0.0TrpThr: 0.0 ± 0.0
2.358TrpVal: 2.358 ± 0.203
0.0TrpTrp: 0.0 ± 0.0
2.358TrpTyr: 2.358 ± 1.845
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.203
2.358TyrCys: 2.358 ± 0.203
3.538TyrAsp: 3.538 ± 1.126
3.538TyrGlu: 3.538 ± 0.516
1.179TyrPhe: 1.179 ± 0.719
1.179TyrGly: 1.179 ± 0.923
7.075TyrHis: 7.075 ± 2.252
4.717TyrIle: 4.717 ± 1.235
2.358TyrLys: 2.358 ± 1.845
7.075TyrLeu: 7.075 ± 1.032
2.358TyrMet: 2.358 ± 1.439
0.0TyrAsn: 0.0 ± 0.0
1.179TyrPro: 1.179 ± 0.923
1.179TyrGln: 1.179 ± 0.923
0.0TyrArg: 0.0 ± 0.0
2.358TyrSer: 2.358 ± 1.845
2.358TyrThr: 2.358 ± 1.845
3.538TyrVal: 3.538 ± 0.516
0.0TyrTrp: 0.0 ± 0.0
1.179TyrTyr: 1.179 ± 0.923
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (849 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski