Amino acid dipepetide frequency for Hubei sobemo-like virus 25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.625AlaAla: 9.625 ± 2.329
1.925AlaCys: 1.925 ± 1.026
3.85AlaAsp: 3.85 ± 0.75
4.812AlaGlu: 4.812 ± 1.165
0.962AlaPhe: 0.962 ± 0.888
5.775AlaGly: 5.775 ± 1.678
0.962AlaHis: 0.962 ± 0.513
3.85AlaIle: 3.85 ± 2.053
0.962AlaLys: 0.962 ± 0.513
5.775AlaLeu: 5.775 ± 1.678
0.962AlaMet: 0.962 ± 0.888
0.0AlaAsn: 0.0 ± 0.0
2.887AlaPro: 2.887 ± 0.138
3.85AlaGln: 3.85 ± 0.75
0.962AlaArg: 0.962 ± 0.888
3.85AlaSer: 3.85 ± 2.053
10.587AlaThr: 10.587 ± 4.244
5.775AlaVal: 5.775 ± 3.079
0.962AlaTrp: 0.962 ± 0.513
1.925AlaTyr: 1.925 ± 0.375
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
3.85CysAsp: 3.85 ± 0.651
0.0CysGlu: 0.0 ± 0.0
0.962CysPhe: 0.962 ± 0.888
0.962CysGly: 0.962 ± 0.513
0.0CysHis: 0.0 ± 0.0
0.962CysIle: 0.962 ± 0.888
1.925CysLys: 1.925 ± 1.026
0.962CysLeu: 0.962 ± 0.513
0.962CysMet: 0.962 ± 0.888
0.962CysAsn: 0.962 ± 0.513
0.962CysPro: 0.962 ± 0.513
0.962CysGln: 0.962 ± 0.888
1.925CysArg: 1.925 ± 0.375
2.887CysSer: 2.887 ± 1.54
1.925CysThr: 1.925 ± 1.026
0.962CysVal: 0.962 ± 0.513
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.962AspCys: 0.962 ± 0.513
5.775AspAsp: 5.775 ± 0.277
4.812AspGlu: 4.812 ± 0.237
1.925AspPhe: 1.925 ± 1.776
1.925AspGly: 1.925 ± 0.375
0.962AspHis: 0.962 ± 0.888
0.962AspIle: 0.962 ± 0.513
2.887AspLys: 2.887 ± 2.665
3.85AspLeu: 3.85 ± 0.651
2.887AspMet: 2.887 ± 1.263
4.812AspAsn: 4.812 ± 0.237
5.775AspPro: 5.775 ± 1.125
2.887AspGln: 2.887 ± 1.263
1.925AspArg: 1.925 ± 1.776
0.962AspSer: 0.962 ± 0.888
2.887AspThr: 2.887 ± 0.138
4.812AspVal: 4.812 ± 0.237
1.925AspTrp: 1.925 ± 0.375
0.962AspTyr: 0.962 ± 0.888
0.0AspXaa: 0.0 ± 0.0
Glu
5.775GluAla: 5.775 ± 1.678
0.962GluCys: 0.962 ± 0.513
2.887GluAsp: 2.887 ± 0.138
6.737GluGlu: 6.737 ± 0.79
0.962GluPhe: 0.962 ± 0.888
6.737GluGly: 6.737 ± 0.79
1.925GluHis: 1.925 ± 0.375
1.925GluIle: 1.925 ± 0.375
3.85GluLys: 3.85 ± 0.75
3.85GluLeu: 3.85 ± 0.651
1.925GluMet: 1.925 ± 0.375
0.0GluAsn: 0.0 ± 0.0
2.887GluPro: 2.887 ± 1.263
0.962GluGln: 0.962 ± 0.888
2.887GluArg: 2.887 ± 1.54
9.625GluSer: 9.625 ± 3.731
3.85GluThr: 3.85 ± 0.651
1.925GluVal: 1.925 ± 0.375
0.962GluTrp: 0.962 ± 0.888
1.925GluTyr: 1.925 ± 0.375
0.0GluXaa: 0.0 ± 0.0
Phe
2.887PheAla: 2.887 ± 0.138
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.925PheGlu: 1.925 ± 0.375
0.962PhePhe: 0.962 ± 0.513
2.887PheGly: 2.887 ± 1.263
0.962PheHis: 0.962 ± 0.888
2.887PheIle: 2.887 ± 1.263
0.0PheLys: 0.0 ± 0.0
3.85PheLeu: 3.85 ± 0.651
0.0PheMet: 0.0 ± 0.0
1.925PheAsn: 1.925 ± 0.375
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
0.962PheArg: 0.962 ± 0.888
0.962PheSer: 0.962 ± 0.513
3.85PheThr: 3.85 ± 0.651
4.812PheVal: 4.812 ± 0.237
0.0PheTrp: 0.0 ± 0.0
1.925PheTyr: 1.925 ± 1.026
0.0PheXaa: 0.0 ± 0.0
Gly
7.7GlyAla: 7.7 ± 4.106
0.962GlyCys: 0.962 ± 0.888
1.925GlyAsp: 1.925 ± 1.776
2.887GlyGlu: 2.887 ± 0.138
4.812GlyPhe: 4.812 ± 2.566
4.812GlyGly: 4.812 ± 1.165
4.812GlyHis: 4.812 ± 3.039
2.887GlyIle: 2.887 ± 1.54
1.925GlyLys: 1.925 ± 0.375
2.887GlyLeu: 2.887 ± 0.138
1.925GlyMet: 1.925 ± 1.026
0.962GlyAsn: 0.962 ± 0.888
1.925GlyPro: 1.925 ± 1.026
1.925GlyGln: 1.925 ± 0.375
4.812GlyArg: 4.812 ± 2.566
4.812GlySer: 4.812 ± 2.566
4.812GlyThr: 4.812 ± 2.566
6.737GlyVal: 6.737 ± 0.79
2.887GlyTrp: 2.887 ± 2.665
4.812GlyTyr: 4.812 ± 3.039
0.0GlyXaa: 0.0 ± 0.0
His
2.887HisAla: 2.887 ± 1.263
0.962HisCys: 0.962 ± 0.513
0.0HisAsp: 0.0 ± 0.0
0.962HisGlu: 0.962 ± 0.888
0.962HisPhe: 0.962 ± 0.888
1.925HisGly: 1.925 ± 0.375
0.962HisHis: 0.962 ± 0.513
0.0HisIle: 0.0 ± 0.0
1.925HisLys: 1.925 ± 0.375
1.925HisLeu: 1.925 ± 1.776
0.0HisMet: 0.0 ± 0.569
0.0HisAsn: 0.0 ± 0.0
3.85HisPro: 3.85 ± 0.651
0.962HisGln: 0.962 ± 0.513
3.85HisArg: 3.85 ± 0.75
1.925HisSer: 1.925 ± 1.026
0.0HisThr: 0.0 ± 0.0
1.925HisVal: 1.925 ± 1.026
0.0HisTrp: 0.0 ± 0.0
1.925HisTyr: 1.925 ± 1.776
0.0HisXaa: 0.0 ± 0.0
Ile
0.962IleAla: 0.962 ± 0.513
0.962IleCys: 0.962 ± 0.513
0.0IleAsp: 0.0 ± 0.0
1.925IleGlu: 1.925 ± 0.375
2.887IlePhe: 2.887 ± 0.138
1.925IleGly: 1.925 ± 1.026
1.925IleHis: 1.925 ± 0.375
0.962IleIle: 0.962 ± 0.888
1.925IleLys: 1.925 ± 0.375
2.887IleLeu: 2.887 ± 0.138
0.962IleMet: 0.962 ± 0.888
0.962IleAsn: 0.962 ± 0.888
4.812IlePro: 4.812 ± 1.638
2.887IleGln: 2.887 ± 0.138
4.812IleArg: 4.812 ± 1.638
2.887IleSer: 2.887 ± 0.138
0.0IleThr: 0.0 ± 0.0
3.85IleVal: 3.85 ± 2.053
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.775LysAla: 5.775 ± 3.079
0.962LysCys: 0.962 ± 0.513
3.85LysAsp: 3.85 ± 0.651
0.962LysGlu: 0.962 ± 0.888
0.962LysPhe: 0.962 ± 0.513
2.887LysGly: 2.887 ± 0.138
1.925LysHis: 1.925 ± 0.375
0.0LysIle: 0.0 ± 0.0
6.737LysLys: 6.737 ± 0.79
7.7LysLeu: 7.7 ± 1.5
1.925LysMet: 1.925 ± 1.776
0.962LysAsn: 0.962 ± 0.513
3.85LysPro: 3.85 ± 2.151
0.962LysGln: 0.962 ± 0.513
2.887LysArg: 2.887 ± 1.54
3.85LysSer: 3.85 ± 2.151
5.775LysThr: 5.775 ± 3.079
0.962LysVal: 0.962 ± 0.513
0.0LysTrp: 0.0 ± 0.0
0.962LysTyr: 0.962 ± 0.513
0.0LysXaa: 0.0 ± 0.0
Leu
10.587LeuAla: 10.587 ± 0.04
0.962LeuCys: 0.962 ± 0.888
3.85LeuAsp: 3.85 ± 0.75
9.625LeuGlu: 9.625 ± 0.473
2.887LeuPhe: 2.887 ± 1.263
8.662LeuGly: 8.662 ± 1.816
0.962LeuHis: 0.962 ± 0.888
5.775LeuIle: 5.775 ± 0.277
4.812LeuLys: 4.812 ± 1.638
8.662LeuLeu: 8.662 ± 0.415
0.962LeuMet: 0.962 ± 0.888
4.812LeuAsn: 4.812 ± 1.638
1.925LeuPro: 1.925 ± 1.026
1.925LeuGln: 1.925 ± 1.776
6.737LeuArg: 6.737 ± 0.79
1.925LeuSer: 1.925 ± 0.375
2.887LeuThr: 2.887 ± 0.138
4.812LeuVal: 4.812 ± 2.566
2.887LeuTrp: 2.887 ± 1.263
6.737LeuTyr: 6.737 ± 2.013
0.0LeuXaa: 0.0 ± 0.0
Met
0.962MetAla: 0.962 ± 0.888
0.962MetCys: 0.962 ± 0.888
0.962MetAsp: 0.962 ± 0.888
2.887MetGlu: 2.887 ± 1.54
0.962MetPhe: 0.962 ± 0.513
2.887MetGly: 2.887 ± 1.263
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
6.737MetLeu: 6.737 ± 2.013
0.962MetMet: 0.962 ± 0.888
0.962MetAsn: 0.962 ± 0.513
2.887MetPro: 2.887 ± 0.138
0.0MetGln: 0.0 ± 0.0
0.962MetArg: 0.962 ± 0.888
1.925MetSer: 1.925 ± 0.375
0.962MetThr: 0.962 ± 0.513
3.85MetVal: 3.85 ± 0.75
0.0MetTrp: 0.0 ± 0.0
1.925MetTyr: 1.925 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
1.925AsnAla: 1.925 ± 1.026
0.962AsnCys: 0.962 ± 0.888
0.962AsnAsp: 0.962 ± 0.888
2.887AsnGlu: 2.887 ± 0.138
1.925AsnPhe: 1.925 ± 1.026
0.962AsnGly: 0.962 ± 0.513
1.925AsnHis: 1.925 ± 1.026
0.962AsnIle: 0.962 ± 0.888
3.85AsnLys: 3.85 ± 0.651
2.887AsnLeu: 2.887 ± 1.263
0.962AsnMet: 0.962 ± 0.513
0.962AsnAsn: 0.962 ± 0.513
0.962AsnPro: 0.962 ± 0.888
2.887AsnGln: 2.887 ± 1.54
0.962AsnArg: 0.962 ± 0.888
0.0AsnSer: 0.0 ± 0.0
1.925AsnThr: 1.925 ± 1.776
0.0AsnVal: 0.0 ± 0.0
0.962AsnTrp: 0.962 ± 0.888
1.925AsnTyr: 1.925 ± 1.026
0.0AsnXaa: 0.0 ± 0.0
Pro
2.887ProAla: 2.887 ± 1.54
1.925ProCys: 1.925 ± 1.026
5.775ProAsp: 5.775 ± 2.526
2.887ProGlu: 2.887 ± 1.263
1.925ProPhe: 1.925 ± 1.026
4.812ProGly: 4.812 ± 1.638
2.887ProHis: 2.887 ± 1.263
1.925ProIle: 1.925 ± 1.776
4.812ProLys: 4.812 ± 2.566
4.812ProLeu: 4.812 ± 3.039
0.962ProMet: 0.962 ± 0.513
0.962ProAsn: 0.962 ± 0.888
3.85ProPro: 3.85 ± 2.151
0.962ProGln: 0.962 ± 0.513
2.887ProArg: 2.887 ± 1.54
5.775ProSer: 5.775 ± 1.678
0.962ProThr: 0.962 ± 0.888
3.85ProVal: 3.85 ± 0.651
0.962ProTrp: 0.962 ± 0.888
1.925ProTyr: 1.925 ± 1.776
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.962GlnCys: 0.962 ± 0.513
1.925GlnAsp: 1.925 ± 0.375
2.887GlnGlu: 2.887 ± 0.138
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
0.962GlnIle: 0.962 ± 0.513
2.887GlnLys: 2.887 ± 1.263
3.85GlnLeu: 3.85 ± 0.75
3.85GlnMet: 3.85 ± 0.75
1.925GlnAsn: 1.925 ± 0.375
2.887GlnPro: 2.887 ± 0.138
3.85GlnGln: 3.85 ± 3.553
2.887GlnArg: 2.887 ± 0.138
0.0GlnSer: 0.0 ± 0.0
2.887GlnThr: 2.887 ± 1.263
2.887GlnVal: 2.887 ± 1.263
0.0GlnTrp: 0.0 ± 0.0
0.962GlnTyr: 0.962 ± 0.513
0.0GlnXaa: 0.0 ± 0.0
Arg
1.925ArgAla: 1.925 ± 1.776
0.962ArgCys: 0.962 ± 0.513
4.812ArgAsp: 4.812 ± 3.039
0.962ArgGlu: 0.962 ± 0.888
0.962ArgPhe: 0.962 ± 0.888
0.962ArgGly: 0.962 ± 0.513
0.962ArgHis: 0.962 ± 0.513
4.812ArgIle: 4.812 ± 1.638
2.887ArgLys: 2.887 ± 0.138
9.625ArgLeu: 9.625 ± 1.875
0.962ArgMet: 0.962 ± 0.513
2.887ArgAsn: 2.887 ± 0.138
3.85ArgPro: 3.85 ± 0.651
0.0ArgGln: 0.0 ± 0.0
6.737ArgArg: 6.737 ± 0.612
6.737ArgSer: 6.737 ± 0.79
3.85ArgThr: 3.85 ± 2.053
7.7ArgVal: 7.7 ± 1.303
0.0ArgTrp: 0.0 ± 0.0
1.925ArgTyr: 1.925 ± 1.776
0.0ArgXaa: 0.0 ± 0.0
Ser
3.85SerAla: 3.85 ± 0.651
2.887SerCys: 2.887 ± 1.54
6.737SerAsp: 6.737 ± 0.612
5.775SerGlu: 5.775 ± 3.079
1.925SerPhe: 1.925 ± 1.026
7.7SerGly: 7.7 ± 0.098
0.962SerHis: 0.962 ± 0.513
1.925SerIle: 1.925 ± 0.375
1.925SerLys: 1.925 ± 1.026
6.737SerLeu: 6.737 ± 2.191
0.962SerMet: 0.962 ± 0.436
0.962SerAsn: 0.962 ± 0.513
4.812SerPro: 4.812 ± 0.237
0.962SerGln: 0.962 ± 0.513
2.887SerArg: 2.887 ± 2.665
8.662SerSer: 8.662 ± 1.816
4.812SerThr: 4.812 ± 2.566
4.812SerVal: 4.812 ± 0.237
0.962SerTrp: 0.962 ± 0.513
3.85SerTyr: 3.85 ± 0.75
0.0SerXaa: 0.0 ± 0.0
Thr
5.775ThrAla: 5.775 ± 3.079
0.962ThrCys: 0.962 ± 0.513
1.925ThrAsp: 1.925 ± 1.776
3.85ThrGlu: 3.85 ± 2.053
1.925ThrPhe: 1.925 ± 0.375
4.812ThrGly: 4.812 ± 1.165
0.962ThrHis: 0.962 ± 0.513
0.0ThrIle: 0.0 ± 0.0
1.925ThrLys: 1.925 ± 1.026
6.737ThrLeu: 6.737 ± 0.612
2.887ThrMet: 2.887 ± 1.54
1.925ThrAsn: 1.925 ± 0.375
1.925ThrPro: 1.925 ± 0.375
2.887ThrGln: 2.887 ± 1.54
6.737ThrArg: 6.737 ± 0.79
6.737ThrSer: 6.737 ± 2.191
0.962ThrThr: 0.962 ± 0.513
3.85ThrVal: 3.85 ± 2.053
0.962ThrTrp: 0.962 ± 0.888
0.962ThrTyr: 0.962 ± 0.888
0.0ThrXaa: 0.0 ± 0.0
Val
1.925ValAla: 1.925 ± 1.026
1.925ValCys: 1.925 ± 0.375
2.887ValAsp: 2.887 ± 0.138
1.925ValGlu: 1.925 ± 1.026
2.887ValPhe: 2.887 ± 1.263
6.737ValGly: 6.737 ± 3.592
2.887ValHis: 2.887 ± 0.138
3.85ValIle: 3.85 ± 0.651
5.775ValLys: 5.775 ± 3.079
7.7ValLeu: 7.7 ± 0.098
1.925ValMet: 1.925 ± 1.026
1.925ValAsn: 1.925 ± 1.026
4.812ValPro: 4.812 ± 1.165
2.887ValGln: 2.887 ± 1.263
3.85ValArg: 3.85 ± 0.651
4.812ValSer: 4.812 ± 1.165
2.887ValThr: 2.887 ± 0.138
7.7ValVal: 7.7 ± 0.098
2.887ValTrp: 2.887 ± 1.54
1.925ValTyr: 1.925 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
1.925TrpAla: 1.925 ± 0.375
0.0TrpCys: 0.0 ± 0.0
1.925TrpAsp: 1.925 ± 0.375
0.962TrpGlu: 0.962 ± 0.888
0.0TrpPhe: 0.0 ± 0.0
0.962TrpGly: 0.962 ± 0.513
0.0TrpHis: 0.0 ± 0.0
1.925TrpIle: 1.925 ± 1.026
0.962TrpLys: 0.962 ± 0.513
0.0TrpLeu: 0.0 ± 0.0
0.962TrpMet: 0.962 ± 0.888
0.962TrpAsn: 0.962 ± 0.513
0.962TrpPro: 0.962 ± 0.888
0.962TrpGln: 0.962 ± 0.888
0.0TrpArg: 0.0 ± 0.0
2.887TrpSer: 2.887 ± 2.665
0.962TrpThr: 0.962 ± 0.888
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.887TyrAla: 2.887 ± 1.54
0.962TyrCys: 0.962 ± 0.888
0.0TyrAsp: 0.0 ± 0.0
2.887TyrGlu: 2.887 ± 0.138
0.0TyrPhe: 0.0 ± 0.0
2.887TyrGly: 2.887 ± 1.263
1.925TyrHis: 1.925 ± 0.375
0.962TyrIle: 0.962 ± 0.888
1.925TyrLys: 1.925 ± 0.375
0.962TyrLeu: 0.962 ± 0.888
2.887TyrMet: 2.887 ± 1.263
1.925TyrAsn: 1.925 ± 1.026
1.925TyrPro: 1.925 ± 1.776
2.887TyrGln: 2.887 ± 1.263
3.85TyrArg: 3.85 ± 3.553
2.887TyrSer: 2.887 ± 1.263
1.925TyrThr: 1.925 ± 1.776
2.887TyrVal: 2.887 ± 1.54
0.0TyrTrp: 0.0 ± 0.0
0.962TyrTyr: 0.962 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1040 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski