Amino acid dipepetide frequency for Hubei sobemo-like virus 49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.764AlaAla: 3.764 ± 2.172
2.509AlaCys: 2.509 ± 1.448
0.0AlaAsp: 0.0 ± 0.0
3.764AlaGlu: 3.764 ± 0.487
1.255AlaPhe: 1.255 ± 0.724
2.509AlaGly: 2.509 ± 1.448
0.0AlaHis: 0.0 ± 0.0
5.019AlaIle: 5.019 ± 1.211
5.019AlaLys: 5.019 ± 1.211
5.019AlaLeu: 5.019 ± 1.211
0.0AlaMet: 0.0 ± 0.0
2.509AlaAsn: 2.509 ± 1.448
5.019AlaPro: 5.019 ± 1.211
3.764AlaGln: 3.764 ± 2.172
5.019AlaArg: 5.019 ± 0.474
5.019AlaSer: 5.019 ± 0.474
2.509AlaThr: 2.509 ± 0.237
8.783AlaVal: 8.783 ± 1.699
2.509AlaTrp: 2.509 ± 1.922
2.509AlaTyr: 2.509 ± 0.237
0.0AlaXaa: 0.0 ± 0.0
Cys
1.255CysAla: 1.255 ± 0.724
1.255CysCys: 1.255 ± 0.961
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.509CysPhe: 2.509 ± 1.448
2.509CysGly: 2.509 ± 1.922
0.0CysHis: 0.0 ± 0.0
1.255CysIle: 1.255 ± 0.961
1.255CysLys: 1.255 ± 0.724
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.255CysGln: 1.255 ± 0.961
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.509CysVal: 2.509 ± 0.237
0.0CysTrp: 0.0 ± 0.0
1.255CysTyr: 1.255 ± 0.724
0.0CysXaa: 0.0 ± 0.0
Asp
5.019AspAla: 5.019 ± 1.211
0.0AspCys: 0.0 ± 0.0
3.764AspAsp: 3.764 ± 0.487
5.019AspGlu: 5.019 ± 2.158
1.255AspPhe: 1.255 ± 0.724
5.019AspGly: 5.019 ± 0.474
1.255AspHis: 1.255 ± 0.724
2.509AspIle: 2.509 ± 1.448
5.019AspLys: 5.019 ± 2.158
7.528AspLeu: 7.528 ± 2.395
3.764AspMet: 3.764 ± 0.487
2.509AspAsn: 2.509 ± 1.922
2.509AspPro: 2.509 ± 0.237
0.0AspGln: 0.0 ± 0.0
1.255AspArg: 1.255 ± 0.724
6.274AspSer: 6.274 ± 0.25
2.509AspThr: 2.509 ± 0.237
5.019AspVal: 5.019 ± 1.211
2.509AspTrp: 2.509 ± 1.922
3.764AspTyr: 3.764 ± 1.198
0.0AspXaa: 0.0 ± 0.0
Glu
6.274GluAla: 6.274 ± 0.25
0.0GluCys: 0.0 ± 0.0
1.255GluAsp: 1.255 ± 0.961
1.255GluGlu: 1.255 ± 0.961
5.019GluPhe: 5.019 ± 2.158
1.255GluGly: 1.255 ± 0.961
1.255GluHis: 1.255 ± 0.724
5.019GluIle: 5.019 ± 2.158
1.255GluLys: 1.255 ± 0.724
2.509GluLeu: 2.509 ± 1.922
2.509GluMet: 2.509 ± 1.448
1.255GluAsn: 1.255 ± 0.961
2.509GluPro: 2.509 ± 0.237
0.0GluGln: 0.0 ± 0.0
3.764GluArg: 3.764 ± 0.487
6.274GluSer: 6.274 ± 1.935
2.509GluThr: 2.509 ± 0.237
1.255GluVal: 1.255 ± 0.724
2.509GluTrp: 2.509 ± 0.237
3.764GluTyr: 3.764 ± 2.172
0.0GluXaa: 0.0 ± 0.0
Phe
2.509PheAla: 2.509 ± 1.448
0.0PheCys: 0.0 ± 0.0
2.509PheAsp: 2.509 ± 0.237
2.509PheGlu: 2.509 ± 1.922
0.0PhePhe: 0.0 ± 0.0
5.019PheGly: 5.019 ± 1.211
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.255PheLys: 1.255 ± 0.724
1.255PheLeu: 1.255 ± 0.724
0.0PheMet: 0.0 ± 0.0
1.255PheAsn: 1.255 ± 0.724
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
1.255PheArg: 1.255 ± 0.724
1.255PheSer: 1.255 ± 0.724
0.0PheThr: 0.0 ± 0.0
6.274PheVal: 6.274 ± 1.434
0.0PheTrp: 0.0 ± 0.0
1.255PheTyr: 1.255 ± 0.961
0.0PheXaa: 0.0 ± 0.0
Gly
5.019GlyAla: 5.019 ± 2.896
2.509GlyCys: 2.509 ± 0.237
5.019GlyAsp: 5.019 ± 0.474
3.764GlyGlu: 3.764 ± 2.172
1.255GlyPhe: 1.255 ± 0.724
7.528GlyGly: 7.528 ± 2.659
1.255GlyHis: 1.255 ± 0.961
6.274GlyIle: 6.274 ± 1.434
1.255GlyLys: 1.255 ± 0.724
5.019GlyLeu: 5.019 ± 0.474
0.0GlyMet: 0.0 ± 0.0
1.255GlyAsn: 1.255 ± 0.724
6.274GlyPro: 6.274 ± 1.935
2.509GlyGln: 2.509 ± 1.448
5.019GlyArg: 5.019 ± 1.211
3.764GlySer: 3.764 ± 2.172
7.528GlyThr: 7.528 ± 2.659
3.764GlyVal: 3.764 ± 1.198
1.255GlyTrp: 1.255 ± 0.961
6.274GlyTyr: 6.274 ± 0.25
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.255HisCys: 1.255 ± 0.961
2.509HisAsp: 2.509 ± 0.237
1.255HisGlu: 1.255 ± 0.961
2.509HisPhe: 2.509 ± 1.448
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.255HisLys: 1.255 ± 0.961
2.509HisLeu: 2.509 ± 1.922
1.255HisMet: 1.255 ± 0.961
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
2.509HisGln: 2.509 ± 1.448
2.509HisArg: 2.509 ± 0.237
1.255HisSer: 1.255 ± 0.961
0.0HisThr: 0.0 ± 0.0
2.509HisVal: 2.509 ± 0.237
1.255HisTrp: 1.255 ± 0.724
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.255IleAla: 1.255 ± 0.961
1.255IleCys: 1.255 ± 0.724
3.764IleAsp: 3.764 ± 2.882
6.274IleGlu: 6.274 ± 0.25
0.0IlePhe: 0.0 ± 0.0
2.509IleGly: 2.509 ± 1.448
1.255IleHis: 1.255 ± 0.724
0.0IleIle: 0.0 ± 0.0
0.0IleLys: 0.0 ± 0.0
6.274IleLeu: 6.274 ± 3.119
2.509IleMet: 2.509 ± 0.237
1.255IleAsn: 1.255 ± 0.724
2.509IlePro: 2.509 ± 0.237
2.509IleGln: 2.509 ± 1.448
1.255IleArg: 1.255 ± 0.724
2.509IleSer: 2.509 ± 0.237
0.0IleThr: 0.0 ± 0.0
3.764IleVal: 3.764 ± 0.487
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.255LysAla: 1.255 ± 0.724
1.255LysCys: 1.255 ± 0.724
2.509LysAsp: 2.509 ± 1.448
0.0LysGlu: 0.0 ± 0.0
1.255LysPhe: 1.255 ± 0.724
5.019LysGly: 5.019 ± 1.211
2.509LysHis: 2.509 ± 0.237
1.255LysIle: 1.255 ± 0.724
1.255LysLys: 1.255 ± 0.961
3.764LysLeu: 3.764 ± 0.487
2.509LysMet: 2.509 ± 1.922
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
5.019LysGln: 5.019 ± 2.158
5.019LysArg: 5.019 ± 1.211
5.019LysSer: 5.019 ± 3.843
0.0LysThr: 0.0 ± 0.0
6.274LysVal: 6.274 ± 0.25
2.509LysTrp: 2.509 ± 1.922
1.255LysTyr: 1.255 ± 0.724
0.0LysXaa: 0.0 ± 0.0
Leu
3.764LeuAla: 3.764 ± 1.198
0.0LeuCys: 0.0 ± 0.0
5.019LeuAsp: 5.019 ± 2.158
6.274LeuGlu: 6.274 ± 1.434
1.255LeuPhe: 1.255 ± 0.961
8.783LeuGly: 8.783 ± 1.699
0.0LeuHis: 0.0 ± 0.0
3.764LeuIle: 3.764 ± 1.198
6.274LeuLys: 6.274 ± 1.434
5.019LeuLeu: 5.019 ± 1.211
1.255LeuMet: 1.255 ± 0.724
1.255LeuAsn: 1.255 ± 0.961
5.019LeuPro: 5.019 ± 3.843
1.255LeuGln: 1.255 ± 0.961
3.764LeuArg: 3.764 ± 1.198
12.547LeuSer: 12.547 ± 2.186
0.0LeuThr: 0.0 ± 0.0
12.547LeuVal: 12.547 ± 2.186
2.509LeuTrp: 2.509 ± 1.922
1.255LeuTyr: 1.255 ± 0.961
0.0LeuXaa: 0.0 ± 0.0
Met
3.764MetAla: 3.764 ± 0.487
0.0MetCys: 0.0 ± 0.0
6.274MetAsp: 6.274 ± 0.25
0.0MetGlu: 0.0 ± 0.0
1.255MetPhe: 1.255 ± 0.961
5.019MetGly: 5.019 ± 2.896
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.255MetLys: 1.255 ± 0.961
3.764MetLeu: 3.764 ± 1.198
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.255MetGln: 1.255 ± 0.961
2.509MetArg: 2.509 ± 0.237
5.019MetSer: 5.019 ± 1.211
2.509MetThr: 2.509 ± 0.237
1.255MetVal: 1.255 ± 0.724
0.0MetTrp: 0.0 ± 0.0
3.764MetTyr: 3.764 ± 2.882
0.0MetXaa: 0.0 ± 0.0
Asn
2.509AsnAla: 2.509 ± 0.237
0.0AsnCys: 0.0 ± 0.0
2.509AsnAsp: 2.509 ± 1.922
0.0AsnGlu: 0.0 ± 0.0
1.255AsnPhe: 1.255 ± 0.724
2.509AsnGly: 2.509 ± 1.448
0.0AsnHis: 0.0 ± 0.0
1.255AsnIle: 1.255 ± 0.961
1.255AsnLys: 1.255 ± 0.724
5.019AsnLeu: 5.019 ± 1.211
0.0AsnMet: 0.0 ± 0.0
3.764AsnAsn: 3.764 ± 0.487
0.0AsnPro: 0.0 ± 0.0
1.255AsnGln: 1.255 ± 0.724
0.0AsnArg: 0.0 ± 0.0
6.274AsnSer: 6.274 ± 1.434
2.509AsnThr: 2.509 ± 0.237
1.255AsnVal: 1.255 ± 0.961
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.764ProAsp: 3.764 ± 2.172
1.255ProGlu: 1.255 ± 0.724
0.0ProPhe: 0.0 ± 0.0
3.764ProGly: 3.764 ± 1.198
1.255ProHis: 1.255 ± 0.961
1.255ProIle: 1.255 ± 0.724
3.764ProLys: 3.764 ± 1.198
2.509ProLeu: 2.509 ± 0.237
6.274ProMet: 6.274 ± 1.434
0.0ProAsn: 0.0 ± 0.0
1.255ProPro: 1.255 ± 0.724
2.509ProGln: 2.509 ± 0.237
3.764ProArg: 3.764 ± 2.172
3.764ProSer: 3.764 ± 0.487
1.255ProThr: 1.255 ± 0.724
3.764ProVal: 3.764 ± 1.198
0.0ProTrp: 0.0 ± 0.0
1.255ProTyr: 1.255 ± 0.961
0.0ProXaa: 0.0 ± 0.0
Gln
3.764GlnAla: 3.764 ± 0.487
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.255GlnGlu: 1.255 ± 0.961
0.0GlnPhe: 0.0 ± 0.0
3.764GlnGly: 3.764 ± 0.487
0.0GlnHis: 0.0 ± 0.0
1.255GlnIle: 1.255 ± 0.724
3.764GlnLys: 3.764 ± 1.198
6.274GlnLeu: 6.274 ± 0.25
1.255GlnMet: 1.255 ± 0.724
1.255GlnAsn: 1.255 ± 0.724
1.255GlnPro: 1.255 ± 0.961
2.509GlnGln: 2.509 ± 1.922
5.019GlnArg: 5.019 ± 0.474
2.509GlnSer: 2.509 ± 1.448
3.764GlnThr: 3.764 ± 2.172
2.509GlnVal: 2.509 ± 1.922
0.0GlnTrp: 0.0 ± 0.0
5.019GlnTyr: 5.019 ± 2.158
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.255ArgCys: 1.255 ± 0.724
6.274ArgAsp: 6.274 ± 3.62
3.764ArgGlu: 3.764 ± 1.198
3.764ArgPhe: 3.764 ± 0.487
3.764ArgGly: 3.764 ± 2.882
2.509ArgHis: 2.509 ± 1.448
2.509ArgIle: 2.509 ± 0.237
2.509ArgLys: 2.509 ± 0.237
6.274ArgLeu: 6.274 ± 1.434
3.764ArgMet: 3.764 ± 1.198
1.255ArgAsn: 1.255 ± 0.724
1.255ArgPro: 1.255 ± 0.724
3.764ArgGln: 3.764 ± 0.487
8.783ArgArg: 8.783 ± 1.671
1.255ArgSer: 1.255 ± 0.724
5.019ArgThr: 5.019 ± 2.896
2.509ArgVal: 2.509 ± 0.237
2.509ArgTrp: 2.509 ± 0.237
2.509ArgTyr: 2.509 ± 1.448
0.0ArgXaa: 0.0 ± 0.0
Ser
10.038SerAla: 10.038 ± 2.423
0.0SerCys: 0.0 ± 0.0
8.783SerAsp: 8.783 ± 1.671
1.255SerGlu: 1.255 ± 0.724
0.0SerPhe: 0.0 ± 0.0
6.274SerGly: 6.274 ± 1.935
1.255SerHis: 1.255 ± 0.961
1.255SerIle: 1.255 ± 0.724
0.0SerLys: 0.0 ± 0.0
3.764SerLeu: 3.764 ± 1.198
3.764SerMet: 3.764 ± 1.631
3.764SerAsn: 3.764 ± 2.172
3.764SerPro: 3.764 ± 0.487
7.528SerGln: 7.528 ± 0.975
5.019SerArg: 5.019 ± 1.211
7.528SerSer: 7.528 ± 0.975
3.764SerThr: 3.764 ± 0.487
5.019SerVal: 5.019 ± 2.896
3.764SerTrp: 3.764 ± 0.487
2.509SerTyr: 2.509 ± 1.922
0.0SerXaa: 0.0 ± 0.0
Thr
2.509ThrAla: 2.509 ± 0.237
0.0ThrCys: 0.0 ± 0.0
2.509ThrAsp: 2.509 ± 0.237
1.255ThrGlu: 1.255 ± 0.724
0.0ThrPhe: 0.0 ± 0.0
5.019ThrGly: 5.019 ± 2.896
2.509ThrHis: 2.509 ± 0.237
2.509ThrIle: 2.509 ± 0.237
2.509ThrLys: 2.509 ± 1.448
3.764ThrLeu: 3.764 ± 1.198
3.764ThrMet: 3.764 ± 2.172
3.764ThrAsn: 3.764 ± 1.198
3.764ThrPro: 3.764 ± 2.172
0.0ThrGln: 0.0 ± 0.0
1.255ThrArg: 1.255 ± 0.724
1.255ThrSer: 1.255 ± 0.961
5.019ThrThr: 5.019 ± 0.474
3.764ThrVal: 3.764 ± 0.487
0.0ThrTrp: 0.0 ± 0.0
2.509ThrTyr: 2.509 ± 0.237
0.0ThrXaa: 0.0 ± 0.0
Val
10.038ValAla: 10.038 ± 2.423
3.764ValCys: 3.764 ± 2.882
3.764ValAsp: 3.764 ± 0.487
3.764ValGlu: 3.764 ± 0.487
3.764ValPhe: 3.764 ± 1.198
6.274ValGly: 6.274 ± 1.935
3.764ValHis: 3.764 ± 1.198
1.255ValIle: 1.255 ± 0.724
6.274ValLys: 6.274 ± 1.434
6.274ValLeu: 6.274 ± 1.935
2.509ValMet: 2.509 ± 1.448
5.019ValAsn: 5.019 ± 2.158
3.764ValPro: 3.764 ± 0.487
3.764ValGln: 3.764 ± 2.882
2.509ValArg: 2.509 ± 1.448
3.764ValSer: 3.764 ± 2.172
1.255ValThr: 1.255 ± 0.724
11.292ValVal: 11.292 ± 0.223
1.255ValTrp: 1.255 ± 0.961
5.019ValTyr: 5.019 ± 1.211
0.0ValXaa: 0.0 ± 0.0
Trp
1.255TrpAla: 1.255 ± 0.961
0.0TrpCys: 0.0 ± 0.0
2.509TrpAsp: 2.509 ± 1.922
1.255TrpGlu: 1.255 ± 0.961
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
3.764TrpHis: 3.764 ± 2.882
0.0TrpIle: 0.0 ± 0.0
1.255TrpLys: 1.255 ± 0.724
2.509TrpLeu: 2.509 ± 0.237
0.0TrpMet: 0.0 ± 0.673
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.764TrpArg: 3.764 ± 1.198
0.0TrpSer: 0.0 ± 0.0
1.255TrpThr: 1.255 ± 0.961
3.764TrpVal: 3.764 ± 0.487
0.0TrpTrp: 0.0 ± 0.0
1.255TrpTyr: 1.255 ± 0.961
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.509TyrAla: 2.509 ± 1.448
0.0TyrCys: 0.0 ± 0.0
3.764TyrAsp: 3.764 ± 1.198
7.528TyrGlu: 7.528 ± 2.659
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
2.509TyrIle: 2.509 ± 0.237
1.255TyrLys: 1.255 ± 0.724
2.509TyrLeu: 2.509 ± 1.922
1.255TyrMet: 1.255 ± 0.961
1.255TyrAsn: 1.255 ± 0.961
2.509TyrPro: 2.509 ± 1.922
3.764TyrGln: 3.764 ± 1.198
3.764TyrArg: 3.764 ± 1.198
3.764TyrSer: 3.764 ± 2.172
6.274TyrThr: 6.274 ± 1.434
1.255TyrVal: 1.255 ± 0.961
1.255TyrTrp: 1.255 ± 0.961
2.509TyrTyr: 2.509 ± 1.922
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (798 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski