Amino acid dipepetide frequency for Hubei sobemo-like virus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.184AlaAla: 7.184 ± 3.995
0.0AlaCys: 0.0 ± 0.0
5.747AlaAsp: 5.747 ± 2.767
7.184AlaGlu: 7.184 ± 0.294
1.437AlaPhe: 1.437 ± 1.228
2.874AlaGly: 2.874 ± 1.833
0.0AlaHis: 0.0 ± 0.0
4.31AlaIle: 4.31 ± 0.605
2.874AlaLys: 2.874 ± 2.456
2.874AlaLeu: 2.874 ± 0.311
1.437AlaMet: 1.437 ± 0.917
2.874AlaAsn: 2.874 ± 2.456
7.184AlaPro: 7.184 ± 1.85
1.437AlaGln: 1.437 ± 1.228
5.747AlaArg: 5.747 ± 1.522
1.437AlaSer: 1.437 ± 1.228
2.874AlaThr: 2.874 ± 0.311
5.747AlaVal: 5.747 ± 3.667
1.437AlaTrp: 1.437 ± 0.917
5.747AlaTyr: 5.747 ± 2.767
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.437CysIle: 1.437 ± 0.917
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.437CysPro: 1.437 ± 0.917
0.0CysGln: 0.0 ± 0.0
1.437CysArg: 1.437 ± 0.917
2.874CysSer: 2.874 ± 0.311
0.0CysThr: 0.0 ± 0.0
2.874CysVal: 2.874 ± 0.311
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.874AspAla: 2.874 ± 0.311
0.0AspCys: 0.0 ± 0.0
15.805AspAsp: 15.805 ± 9.218
8.621AspGlu: 8.621 ± 3.078
7.184AspPhe: 7.184 ± 1.85
4.31AspGly: 4.31 ± 2.75
1.437AspHis: 1.437 ± 0.917
1.437AspIle: 1.437 ± 0.917
4.31AspLys: 4.31 ± 1.539
4.31AspLeu: 4.31 ± 0.605
1.437AspMet: 1.437 ± 0.917
2.874AspAsn: 2.874 ± 1.833
4.31AspPro: 4.31 ± 2.75
4.31AspGln: 4.31 ± 0.605
12.931AspArg: 12.931 ± 11.051
4.31AspSer: 4.31 ± 1.539
1.437AspThr: 1.437 ± 0.917
4.31AspVal: 4.31 ± 2.75
2.874AspTrp: 2.874 ± 1.833
5.747AspTyr: 5.747 ± 2.767
0.0AspXaa: 0.0 ± 0.0
Glu
8.621GluAla: 8.621 ± 0.934
0.0GluCys: 0.0 ± 0.0
7.184GluAsp: 7.184 ± 1.85
7.184GluGlu: 7.184 ± 1.85
0.0GluPhe: 0.0 ± 0.0
1.437GluGly: 1.437 ± 0.917
2.874GluHis: 2.874 ± 2.456
4.31GluIle: 4.31 ± 2.75
1.437GluLys: 1.437 ± 1.228
4.31GluLeu: 4.31 ± 3.684
5.747GluMet: 5.747 ± 3.259
2.874GluAsn: 2.874 ± 0.311
5.747GluPro: 5.747 ± 3.667
2.874GluGln: 2.874 ± 0.311
5.747GluArg: 5.747 ± 2.767
2.874GluSer: 2.874 ± 2.456
1.437GluThr: 1.437 ± 1.228
2.874GluVal: 2.874 ± 0.311
2.874GluTrp: 2.874 ± 1.833
2.874GluTyr: 2.874 ± 0.311
0.0GluXaa: 0.0 ± 0.0
Phe
1.437PheAla: 1.437 ± 0.917
0.0PheCys: 0.0 ± 0.0
4.31PheAsp: 4.31 ± 3.684
0.0PheGlu: 0.0 ± 0.0
1.437PhePhe: 1.437 ± 1.228
1.437PheGly: 1.437 ± 1.228
1.437PheHis: 1.437 ± 0.917
0.0PheIle: 0.0 ± 0.0
1.437PheLys: 1.437 ± 1.228
2.874PheLeu: 2.874 ± 0.311
1.437PheMet: 1.437 ± 0.917
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
1.437PheGln: 1.437 ± 0.917
2.874PheArg: 2.874 ± 0.311
1.437PheSer: 1.437 ± 0.917
0.0PheThr: 0.0 ± 0.0
2.874PheVal: 2.874 ± 1.833
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.31GlyAla: 4.31 ± 3.684
1.437GlyCys: 1.437 ± 0.917
4.31GlyAsp: 4.31 ± 0.605
4.31GlyGlu: 4.31 ± 0.605
1.437GlyPhe: 1.437 ± 0.917
4.31GlyGly: 4.31 ± 0.605
0.0GlyHis: 0.0 ± 0.0
2.874GlyIle: 2.874 ± 0.311
2.874GlyLys: 2.874 ± 0.311
4.31GlyLeu: 4.31 ± 0.605
0.0GlyMet: 0.0 ± 0.0
1.437GlyAsn: 1.437 ± 1.228
0.0GlyPro: 0.0 ± 0.0
2.874GlyGln: 2.874 ± 2.456
2.874GlyArg: 2.874 ± 1.833
5.747GlySer: 5.747 ± 2.767
0.0GlyThr: 0.0 ± 0.0
2.874GlyVal: 2.874 ± 1.833
2.874GlyTrp: 2.874 ± 1.833
4.31GlyTyr: 4.31 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
1.437HisAla: 1.437 ± 0.917
1.437HisCys: 1.437 ± 1.228
1.437HisAsp: 1.437 ± 0.917
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.437HisGly: 1.437 ± 0.917
0.0HisHis: 0.0 ± 0.0
1.437HisIle: 1.437 ± 0.917
2.874HisLys: 2.874 ± 1.833
1.437HisLeu: 1.437 ± 1.228
1.437HisMet: 1.437 ± 0.917
0.0HisAsn: 0.0 ± 0.0
2.874HisPro: 2.874 ± 2.456
0.0HisGln: 0.0 ± 0.0
5.747HisArg: 5.747 ± 1.522
1.437HisSer: 1.437 ± 0.917
0.0HisThr: 0.0 ± 0.0
1.437HisVal: 1.437 ± 0.917
0.0HisTrp: 0.0 ± 0.0
1.437HisTyr: 1.437 ± 0.917
0.0HisXaa: 0.0 ± 0.0
Ile
1.437IleAla: 1.437 ± 1.228
0.0IleCys: 0.0 ± 0.0
1.437IleAsp: 1.437 ± 0.917
2.874IleGlu: 2.874 ± 1.833
0.0IlePhe: 0.0 ± 0.0
1.437IleGly: 1.437 ± 1.228
1.437IleHis: 1.437 ± 0.917
2.874IleIle: 2.874 ± 0.311
0.0IleLys: 0.0 ± 0.0
4.31IleLeu: 4.31 ± 2.75
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
4.31IlePro: 4.31 ± 0.605
1.437IleGln: 1.437 ± 0.917
2.874IleArg: 2.874 ± 1.833
5.747IleSer: 5.747 ± 1.522
0.0IleThr: 0.0 ± 0.0
2.874IleVal: 2.874 ± 1.833
1.437IleTrp: 1.437 ± 0.917
2.874IleTyr: 2.874 ± 1.833
0.0IleXaa: 0.0 ± 0.0
Lys
2.874LysAla: 2.874 ± 1.833
1.437LysCys: 1.437 ± 1.228
1.437LysAsp: 1.437 ± 0.917
2.874LysGlu: 2.874 ± 1.833
1.437LysPhe: 1.437 ± 1.228
7.184LysGly: 7.184 ± 3.995
2.874LysHis: 2.874 ± 0.311
1.437LysIle: 1.437 ± 0.917
7.184LysLys: 7.184 ± 1.85
2.874LysLeu: 2.874 ± 0.311
0.0LysMet: 0.0 ± 0.789
2.874LysAsn: 2.874 ± 0.311
7.184LysPro: 7.184 ± 1.85
2.874LysGln: 2.874 ± 0.311
2.874LysArg: 2.874 ± 0.311
5.747LysSer: 5.747 ± 0.622
4.31LysThr: 4.31 ± 1.539
1.437LysVal: 1.437 ± 1.228
0.0LysTrp: 0.0 ± 0.0
1.437LysTyr: 1.437 ± 1.228
0.0LysXaa: 0.0 ± 0.0
Leu
7.184LeuAla: 7.184 ± 0.294
0.0LeuCys: 0.0 ± 0.0
12.931LeuAsp: 12.931 ± 2.473
5.747LeuGlu: 5.747 ± 0.622
2.874LeuPhe: 2.874 ± 1.833
7.184LeuGly: 7.184 ± 2.439
5.747LeuHis: 5.747 ± 3.667
2.874LeuIle: 2.874 ± 0.311
5.747LeuLys: 5.747 ± 0.622
10.057LeuLeu: 10.057 ± 4.272
0.0LeuMet: 0.0 ± 0.0
1.437LeuAsn: 1.437 ± 1.228
0.0LeuPro: 0.0 ± 0.0
1.437LeuGln: 1.437 ± 1.228
5.747LeuArg: 5.747 ± 3.667
5.747LeuSer: 5.747 ± 3.667
1.437LeuThr: 1.437 ± 0.917
4.31LeuVal: 4.31 ± 1.539
2.874LeuTrp: 2.874 ± 1.833
4.31LeuTyr: 4.31 ± 2.75
0.0LeuXaa: 0.0 ± 0.0
Met
1.437MetAla: 1.437 ± 0.917
2.874MetCys: 2.874 ± 1.833
1.437MetAsp: 1.437 ± 1.228
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.437MetGly: 1.437 ± 0.917
1.437MetHis: 1.437 ± 0.917
1.437MetIle: 1.437 ± 0.917
2.874MetLys: 2.874 ± 0.311
5.747MetLeu: 5.747 ± 1.522
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.437MetGln: 1.437 ± 0.917
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
2.874MetThr: 2.874 ± 2.456
0.0MetVal: 0.0 ± 0.0
1.437MetTrp: 1.437 ± 0.917
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.437AsnAla: 1.437 ± 1.228
0.0AsnCys: 0.0 ± 0.0
1.437AsnAsp: 1.437 ± 1.228
0.0AsnGlu: 0.0 ± 0.0
1.437AsnPhe: 1.437 ± 0.917
2.874AsnGly: 2.874 ± 2.456
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
4.31AsnLeu: 4.31 ± 1.539
1.437AsnMet: 1.437 ± 0.917
0.0AsnAsn: 0.0 ± 0.0
2.874AsnPro: 2.874 ± 0.311
1.437AsnGln: 1.437 ± 1.228
2.874AsnArg: 2.874 ± 2.456
2.874AsnSer: 2.874 ± 1.833
0.0AsnThr: 0.0 ± 0.0
1.437AsnVal: 1.437 ± 1.228
1.437AsnTrp: 1.437 ± 0.917
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.184ProAla: 7.184 ± 0.294
0.0ProCys: 0.0 ± 0.0
4.31ProAsp: 4.31 ± 2.75
2.874ProGlu: 2.874 ± 0.311
0.0ProPhe: 0.0 ± 0.0
5.747ProGly: 5.747 ± 0.622
1.437ProHis: 1.437 ± 0.917
1.437ProIle: 1.437 ± 0.917
4.31ProLys: 4.31 ± 1.539
8.621ProLeu: 8.621 ± 3.356
0.0ProMet: 0.0 ± 0.0
2.874ProAsn: 2.874 ± 0.311
8.621ProPro: 8.621 ± 0.934
1.437ProGln: 1.437 ± 1.228
1.437ProArg: 1.437 ± 0.917
7.184ProSer: 7.184 ± 0.294
4.31ProThr: 4.31 ± 0.605
1.437ProVal: 1.437 ± 1.228
1.437ProTrp: 1.437 ± 0.917
1.437ProTyr: 1.437 ± 0.917
0.0ProXaa: 0.0 ± 0.0
Gln
4.31GlnAla: 4.31 ± 3.684
1.437GlnCys: 1.437 ± 0.917
1.437GlnAsp: 1.437 ± 0.917
5.747GlnGlu: 5.747 ± 0.622
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
1.437GlnHis: 1.437 ± 1.228
1.437GlnIle: 1.437 ± 0.917
5.747GlnLys: 5.747 ± 0.622
2.874GlnLeu: 2.874 ± 1.833
0.0GlnMet: 0.0 ± 0.0
1.437GlnAsn: 1.437 ± 1.228
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.437GlnArg: 1.437 ± 0.917
1.437GlnSer: 1.437 ± 1.228
0.0GlnThr: 0.0 ± 0.0
2.874GlnVal: 2.874 ± 1.833
0.0GlnTrp: 0.0 ± 0.0
2.874GlnTyr: 2.874 ± 1.833
0.0GlnXaa: 0.0 ± 0.0
Arg
4.31ArgAla: 4.31 ± 0.605
0.0ArgCys: 0.0 ± 0.0
8.621ArgAsp: 8.621 ± 3.078
4.31ArgGlu: 4.31 ± 3.684
4.31ArgPhe: 4.31 ± 1.539
2.874ArgGly: 2.874 ± 0.311
1.437ArgHis: 1.437 ± 0.917
4.31ArgIle: 4.31 ± 2.75
5.747ArgLys: 5.747 ± 0.622
7.184ArgLeu: 7.184 ± 2.439
5.747ArgMet: 5.747 ± 1.522
2.874ArgAsn: 2.874 ± 2.456
4.31ArgPro: 4.31 ± 0.605
1.437ArgGln: 1.437 ± 0.917
2.874ArgArg: 2.874 ± 0.311
2.874ArgSer: 2.874 ± 1.833
1.437ArgThr: 1.437 ± 0.917
7.184ArgVal: 7.184 ± 3.995
7.184ArgTrp: 7.184 ± 0.294
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.31SerAla: 4.31 ± 1.539
0.0SerCys: 0.0 ± 0.0
7.184SerAsp: 7.184 ± 0.294
2.874SerGlu: 2.874 ± 2.456
1.437SerPhe: 1.437 ± 0.917
5.747SerGly: 5.747 ± 0.622
0.0SerHis: 0.0 ± 0.0
2.874SerIle: 2.874 ± 0.311
1.437SerLys: 1.437 ± 1.228
8.621SerLeu: 8.621 ± 3.356
1.437SerMet: 1.437 ± 0.917
1.437SerAsn: 1.437 ± 0.917
7.184SerPro: 7.184 ± 0.294
2.874SerGln: 2.874 ± 1.833
5.747SerArg: 5.747 ± 0.622
10.057SerSer: 10.057 ± 4.306
2.874SerThr: 2.874 ± 2.456
4.31SerVal: 4.31 ± 0.605
1.437SerTrp: 1.437 ± 1.228
1.437SerTyr: 1.437 ± 0.917
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.0ThrCys: 0.0 ± 0.0
1.437ThrAsp: 1.437 ± 0.917
4.31ThrGlu: 4.31 ± 1.539
0.0ThrPhe: 0.0 ± 0.0
0.0ThrGly: 0.0 ± 0.0
1.437ThrHis: 1.437 ± 0.917
1.437ThrIle: 1.437 ± 0.917
0.0ThrLys: 0.0 ± 0.0
1.437ThrLeu: 1.437 ± 0.917
0.0ThrMet: 0.0 ± 0.0
2.874ThrAsn: 2.874 ± 0.311
2.874ThrPro: 2.874 ± 0.311
1.437ThrGln: 1.437 ± 1.228
1.437ThrArg: 1.437 ± 1.228
7.184ThrSer: 7.184 ± 0.294
2.874ThrThr: 2.874 ± 0.311
5.747ThrVal: 5.747 ± 1.522
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.437ValAla: 1.437 ± 0.917
0.0ValCys: 0.0 ± 0.0
7.184ValAsp: 7.184 ± 2.439
7.184ValGlu: 7.184 ± 3.995
1.437ValPhe: 1.437 ± 0.917
1.437ValGly: 1.437 ± 1.228
1.437ValHis: 1.437 ± 1.228
0.0ValIle: 0.0 ± 0.0
10.057ValLys: 10.057 ± 0.017
2.874ValLeu: 2.874 ± 0.311
1.437ValMet: 1.437 ± 0.917
0.0ValAsn: 0.0 ± 0.0
4.31ValPro: 4.31 ± 2.75
2.874ValGln: 2.874 ± 1.833
8.621ValArg: 8.621 ± 1.211
2.874ValSer: 2.874 ± 0.311
2.874ValThr: 2.874 ± 1.833
5.747ValVal: 5.747 ± 0.622
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
4.31TrpAla: 4.31 ± 1.539
0.0TrpCys: 0.0 ± 0.0
2.874TrpAsp: 2.874 ± 0.311
4.31TrpGlu: 4.31 ± 2.75
0.0TrpPhe: 0.0 ± 0.0
1.437TrpGly: 1.437 ± 0.917
0.0TrpHis: 0.0 ± 0.0
1.437TrpIle: 1.437 ± 0.917
1.437TrpLys: 1.437 ± 0.917
1.437TrpLeu: 1.437 ± 0.917
1.437TrpMet: 1.437 ± 0.917
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.31TrpArg: 4.31 ± 2.75
1.437TrpSer: 1.437 ± 1.228
4.31TrpThr: 4.31 ± 2.75
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.31TyrAla: 4.31 ± 0.605
1.437TyrCys: 1.437 ± 0.917
4.31TyrAsp: 4.31 ± 0.605
2.874TyrGlu: 2.874 ± 2.456
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
1.437TyrHis: 1.437 ± 0.917
0.0TyrIle: 0.0 ± 0.0
1.437TyrLys: 1.437 ± 1.228
7.184TyrLeu: 7.184 ± 2.439
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.874TyrPro: 2.874 ± 1.833
2.874TyrGln: 2.874 ± 1.833
1.437TyrArg: 1.437 ± 1.228
0.0TyrSer: 0.0 ± 0.0
1.437TyrThr: 1.437 ± 0.917
1.437TyrVal: 1.437 ± 1.228
1.437TyrTrp: 1.437 ± 1.228
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski