Amino acid dipepetide frequency for Hubei sobemo-like virus 33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.307AlaAla: 8.307 ± 0.221
1.038AlaCys: 1.038 ± 0.929
4.154AlaAsp: 4.154 ± 0.876
1.038AlaGlu: 1.038 ± 0.602
4.154AlaPhe: 4.154 ± 0.876
5.192AlaGly: 5.192 ± 1.478
2.077AlaHis: 2.077 ± 1.204
6.231AlaIle: 6.231 ± 3.611
3.115AlaLys: 3.115 ± 1.806
8.307AlaLeu: 8.307 ± 2.842
2.077AlaMet: 2.077 ± 1.204
0.0AlaAsn: 0.0 ± 0.0
2.077AlaPro: 2.077 ± 1.204
2.077AlaGln: 2.077 ± 0.328
2.077AlaArg: 2.077 ± 0.328
4.154AlaSer: 4.154 ± 0.876
3.115AlaThr: 3.115 ± 1.806
3.115AlaVal: 3.115 ± 1.806
4.154AlaTrp: 4.154 ± 0.655
4.154AlaTyr: 4.154 ± 2.186
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.038CysCys: 1.038 ± 0.602
2.077CysAsp: 2.077 ± 1.859
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.077CysGly: 2.077 ± 0.328
1.038CysHis: 1.038 ± 0.929
1.038CysIle: 1.038 ± 0.929
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.038CysArg: 1.038 ± 0.602
1.038CysSer: 1.038 ± 0.929
1.038CysThr: 1.038 ± 0.602
2.077CysVal: 2.077 ± 0.328
1.038CysTrp: 1.038 ± 0.929
2.077CysTyr: 2.077 ± 0.328
0.0CysXaa: 0.0 ± 0.0
Asp
5.192AspAla: 5.192 ± 1.478
0.0AspCys: 0.0 ± 0.0
2.077AspAsp: 2.077 ± 1.859
5.192AspGlu: 5.192 ± 3.009
3.115AspPhe: 3.115 ± 0.274
3.115AspGly: 3.115 ± 1.257
0.0AspHis: 0.0 ± 0.0
1.038AspIle: 1.038 ± 0.929
2.077AspLys: 2.077 ± 0.328
6.231AspLeu: 6.231 ± 2.514
0.0AspMet: 0.0 ± 0.0
2.077AspAsn: 2.077 ± 1.859
1.038AspPro: 1.038 ± 0.929
1.038AspGln: 1.038 ± 0.929
1.038AspArg: 1.038 ± 0.929
2.077AspSer: 2.077 ± 0.328
4.154AspThr: 4.154 ± 0.876
4.154AspVal: 4.154 ± 0.655
2.077AspTrp: 2.077 ± 1.859
3.115AspTyr: 3.115 ± 0.274
0.0AspXaa: 0.0 ± 0.0
Glu
5.192GluAla: 5.192 ± 3.009
2.077GluCys: 2.077 ± 1.859
0.0GluAsp: 0.0 ± 0.0
7.269GluGlu: 7.269 ± 1.15
3.115GluPhe: 3.115 ± 1.257
2.077GluGly: 2.077 ± 1.204
0.0GluHis: 0.0 ± 0.0
3.115GluIle: 3.115 ± 1.257
7.269GluLys: 7.269 ± 4.213
1.038GluLeu: 1.038 ± 0.929
5.192GluMet: 5.192 ± 1.478
0.0GluAsn: 0.0 ± 0.0
3.115GluPro: 3.115 ± 1.257
1.038GluGln: 1.038 ± 0.602
6.231GluArg: 6.231 ± 4.045
5.192GluSer: 5.192 ± 1.478
2.077GluThr: 2.077 ± 0.328
2.077GluVal: 2.077 ± 1.859
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.038PheAla: 1.038 ± 0.929
1.038PheCys: 1.038 ± 0.602
2.077PheAsp: 2.077 ± 1.204
3.115PheGlu: 3.115 ± 1.257
2.077PhePhe: 2.077 ± 0.328
3.115PheGly: 3.115 ± 0.274
0.0PheHis: 0.0 ± 0.0
3.115PheIle: 3.115 ± 0.274
0.0PheLys: 0.0 ± 0.0
1.038PheLeu: 1.038 ± 0.929
1.038PheMet: 1.038 ± 0.602
1.038PheAsn: 1.038 ± 0.602
3.115PhePro: 3.115 ± 0.274
1.038PheGln: 1.038 ± 0.602
1.038PheArg: 1.038 ± 0.602
1.038PheSer: 1.038 ± 0.929
2.077PheThr: 2.077 ± 0.328
5.192PheVal: 5.192 ± 1.478
1.038PheTrp: 1.038 ± 0.602
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.115GlyAla: 3.115 ± 1.806
1.038GlyCys: 1.038 ± 0.929
4.154GlyAsp: 4.154 ± 0.655
6.231GlyGlu: 6.231 ± 0.983
0.0GlyPhe: 0.0 ± 0.0
2.077GlyGly: 2.077 ± 1.859
2.077GlyHis: 2.077 ± 0.328
5.192GlyIle: 5.192 ± 3.009
5.192GlyLys: 5.192 ± 3.116
6.231GlyLeu: 6.231 ± 2.08
2.077GlyMet: 2.077 ± 1.204
1.038GlyAsn: 1.038 ± 0.929
1.038GlyPro: 1.038 ± 0.602
2.077GlyGln: 2.077 ± 1.204
3.115GlyArg: 3.115 ± 1.806
4.154GlySer: 4.154 ± 0.876
5.192GlyThr: 5.192 ± 1.478
5.192GlyVal: 5.192 ± 1.478
5.192GlyTrp: 5.192 ± 0.053
5.192GlyTyr: 5.192 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.038HisAla: 1.038 ± 0.929
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.038HisPhe: 1.038 ± 0.929
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.038HisIle: 1.038 ± 0.929
1.038HisLys: 1.038 ± 0.929
1.038HisLeu: 1.038 ± 0.602
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.038HisGln: 1.038 ± 0.602
2.077HisArg: 2.077 ± 1.859
2.077HisSer: 2.077 ± 0.328
2.077HisThr: 2.077 ± 0.328
5.192HisVal: 5.192 ± 0.053
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.192IleAla: 5.192 ± 0.053
0.0IleCys: 0.0 ± 0.0
4.154IleAsp: 4.154 ± 3.718
6.231IleGlu: 6.231 ± 0.983
1.038IlePhe: 1.038 ± 0.602
5.192IleGly: 5.192 ± 3.009
1.038IleHis: 1.038 ± 0.929
2.077IleIle: 2.077 ± 1.859
3.115IleLys: 3.115 ± 0.274
2.077IleLeu: 2.077 ± 1.859
4.154IleMet: 4.154 ± 0.655
1.038IleAsn: 1.038 ± 0.602
5.192IlePro: 5.192 ± 1.478
1.038IleGln: 1.038 ± 0.929
5.192IleArg: 5.192 ± 0.053
2.077IleSer: 2.077 ± 0.328
3.115IleThr: 3.115 ± 1.806
2.077IleVal: 2.077 ± 0.328
0.0IleTrp: 0.0 ± 0.0
3.115IleTyr: 3.115 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
4.154LysAla: 4.154 ± 2.186
1.038LysCys: 1.038 ± 0.602
2.077LysAsp: 2.077 ± 0.328
0.0LysGlu: 0.0 ± 0.0
0.0LysPhe: 0.0 ± 0.0
5.192LysGly: 5.192 ± 0.053
2.077LysHis: 2.077 ± 0.328
3.115LysIle: 3.115 ± 0.274
5.192LysLys: 5.192 ± 1.478
2.077LysLeu: 2.077 ± 0.328
1.038LysMet: 1.038 ± 0.929
0.0LysAsn: 0.0 ± 0.0
10.384LysPro: 10.384 ± 2.956
3.115LysGln: 3.115 ± 0.274
1.038LysArg: 1.038 ± 0.602
3.115LysSer: 3.115 ± 1.257
1.038LysThr: 1.038 ± 0.602
5.192LysVal: 5.192 ± 1.478
2.077LysTrp: 2.077 ± 1.204
2.077LysTyr: 2.077 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
3.115LeuAla: 3.115 ± 0.274
1.038LeuCys: 1.038 ± 0.929
10.384LeuAsp: 10.384 ± 3.169
6.231LeuGlu: 6.231 ± 0.983
3.115LeuPhe: 3.115 ± 2.788
4.154LeuGly: 4.154 ± 2.407
1.038LeuHis: 1.038 ± 0.929
5.192LeuIle: 5.192 ± 0.053
0.0LeuLys: 0.0 ± 0.0
13.499LeuLeu: 13.499 ± 4.426
3.115LeuMet: 3.115 ± 0.274
4.154LeuAsn: 4.154 ± 0.655
3.115LeuPro: 3.115 ± 0.274
6.231LeuGln: 6.231 ± 0.983
6.231LeuArg: 6.231 ± 2.08
3.115LeuSer: 3.115 ± 0.274
5.192LeuThr: 5.192 ± 3.009
8.307LeuVal: 8.307 ± 4.815
0.0LeuTrp: 0.0 ± 0.0
3.115LeuTyr: 3.115 ± 2.788
0.0LeuXaa: 0.0 ± 0.0
Met
3.115MetAla: 3.115 ± 1.806
0.0MetCys: 0.0 ± 0.0
3.115MetAsp: 3.115 ± 0.274
1.038MetGlu: 1.038 ± 0.602
1.038MetPhe: 1.038 ± 0.602
2.077MetGly: 2.077 ± 1.859
1.038MetHis: 1.038 ± 0.602
3.115MetIle: 3.115 ± 0.274
3.115MetLys: 3.115 ± 1.806
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
3.115MetAsn: 3.115 ± 2.788
3.115MetPro: 3.115 ± 1.257
3.115MetGln: 3.115 ± 0.274
3.115MetArg: 3.115 ± 0.274
2.077MetSer: 2.077 ± 0.328
2.077MetThr: 2.077 ± 1.204
1.038MetVal: 1.038 ± 0.602
0.0MetTrp: 0.0 ± 0.0
1.038MetTyr: 1.038 ± 0.602
0.0MetXaa: 0.0 ± 0.0
Asn
3.115AsnAla: 3.115 ± 1.257
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
1.038AsnGly: 1.038 ± 0.929
1.038AsnHis: 1.038 ± 0.602
2.077AsnIle: 2.077 ± 0.328
1.038AsnLys: 1.038 ± 0.602
1.038AsnLeu: 1.038 ± 0.602
2.077AsnMet: 2.077 ± 1.859
0.0AsnAsn: 0.0 ± 0.0
1.038AsnPro: 1.038 ± 0.929
2.077AsnGln: 2.077 ± 0.328
2.077AsnArg: 2.077 ± 0.328
6.231AsnSer: 6.231 ± 0.983
2.077AsnThr: 2.077 ± 0.328
2.077AsnVal: 2.077 ± 1.859
0.0AsnTrp: 0.0 ± 0.0
1.038AsnTyr: 1.038 ± 0.929
0.0AsnXaa: 0.0 ± 0.0
Pro
3.115ProAla: 3.115 ± 0.274
1.038ProCys: 1.038 ± 0.602
3.115ProAsp: 3.115 ± 1.257
2.077ProGlu: 2.077 ± 0.328
1.038ProPhe: 1.038 ± 0.602
7.269ProGly: 7.269 ± 1.912
1.038ProHis: 1.038 ± 0.929
2.077ProIle: 2.077 ± 0.328
4.154ProLys: 4.154 ± 0.876
6.231ProLeu: 6.231 ± 0.549
1.038ProMet: 1.038 ± 0.602
1.038ProAsn: 1.038 ± 0.602
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
0.0ProArg: 0.0 ± 0.0
8.307ProSer: 8.307 ± 1.752
3.115ProThr: 3.115 ± 1.806
2.077ProVal: 2.077 ± 1.204
1.038ProTrp: 1.038 ± 0.929
2.077ProTyr: 2.077 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.038GlnCys: 1.038 ± 0.929
1.038GlnAsp: 1.038 ± 0.929
1.038GlnGlu: 1.038 ± 0.602
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
1.038GlnHis: 1.038 ± 0.929
1.038GlnIle: 1.038 ± 0.929
3.115GlnLys: 3.115 ± 0.274
4.154GlnLeu: 4.154 ± 0.655
0.0GlnMet: 0.0 ± 0.0
2.077GlnAsn: 2.077 ± 1.204
2.077GlnPro: 2.077 ± 0.328
3.115GlnGln: 3.115 ± 0.274
6.231GlnArg: 6.231 ± 0.983
4.154GlnSer: 4.154 ± 0.876
3.115GlnThr: 3.115 ± 0.274
3.115GlnVal: 3.115 ± 1.806
1.038GlnTrp: 1.038 ± 0.602
1.038GlnTyr: 1.038 ± 0.929
0.0GlnXaa: 0.0 ± 0.0
Arg
3.115ArgAla: 3.115 ± 0.274
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
2.077ArgGlu: 2.077 ± 0.328
2.077ArgPhe: 2.077 ± 0.328
2.077ArgGly: 2.077 ± 1.204
0.0ArgHis: 0.0 ± 0.0
2.077ArgIle: 2.077 ± 0.328
2.077ArgLys: 2.077 ± 1.204
10.384ArgLeu: 10.384 ± 0.107
5.192ArgMet: 5.192 ± 1.585
1.038ArgAsn: 1.038 ± 0.602
1.038ArgPro: 1.038 ± 0.602
3.115ArgGln: 3.115 ± 1.257
3.115ArgArg: 3.115 ± 1.806
2.077ArgSer: 2.077 ± 0.328
4.154ArgThr: 4.154 ± 0.876
6.231ArgVal: 6.231 ± 0.983
3.115ArgTrp: 3.115 ± 2.788
3.115ArgTyr: 3.115 ± 1.257
0.0ArgXaa: 0.0 ± 0.0
Ser
6.231SerAla: 6.231 ± 0.549
2.077SerCys: 2.077 ± 0.328
2.077SerAsp: 2.077 ± 1.204
5.192SerGlu: 5.192 ± 0.053
0.0SerPhe: 0.0 ± 0.0
7.269SerGly: 7.269 ± 1.15
0.0SerHis: 0.0 ± 0.0
1.038SerIle: 1.038 ± 0.929
3.115SerLys: 3.115 ± 1.257
6.231SerLeu: 6.231 ± 2.08
5.192SerMet: 5.192 ± 1.478
1.038SerAsn: 1.038 ± 0.929
7.269SerPro: 7.269 ± 0.381
3.115SerGln: 3.115 ± 0.274
5.192SerArg: 5.192 ± 0.053
6.231SerSer: 6.231 ± 0.549
7.269SerThr: 7.269 ± 4.213
6.231SerVal: 6.231 ± 5.576
1.038SerTrp: 1.038 ± 0.929
3.115SerTyr: 3.115 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
8.307ThrAla: 8.307 ± 3.284
0.0ThrCys: 0.0 ± 0.0
2.077ThrAsp: 2.077 ± 1.204
2.077ThrGlu: 2.077 ± 0.328
6.231ThrPhe: 6.231 ± 3.611
4.154ThrGly: 4.154 ± 2.407
2.077ThrHis: 2.077 ± 0.328
4.154ThrIle: 4.154 ± 0.655
0.0ThrLys: 0.0 ± 0.0
8.307ThrLeu: 8.307 ± 1.752
0.0ThrMet: 0.0 ± 0.0
2.077ThrAsn: 2.077 ± 0.328
1.038ThrPro: 1.038 ± 0.602
3.115ThrGln: 3.115 ± 0.274
1.038ThrArg: 1.038 ± 0.602
8.307ThrSer: 8.307 ± 0.221
4.154ThrThr: 4.154 ± 0.655
5.192ThrVal: 5.192 ± 3.009
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.154ValAla: 4.154 ± 2.407
2.077ValCys: 2.077 ± 0.328
2.077ValAsp: 2.077 ± 1.204
5.192ValGlu: 5.192 ± 1.585
3.115ValPhe: 3.115 ± 1.806
8.307ValGly: 8.307 ± 1.752
1.038ValHis: 1.038 ± 0.929
4.154ValIle: 4.154 ± 0.655
6.231ValLys: 6.231 ± 0.983
7.269ValLeu: 7.269 ± 2.682
1.038ValMet: 1.038 ± 0.164
3.115ValAsn: 3.115 ± 1.257
4.154ValPro: 4.154 ± 0.876
2.077ValGln: 2.077 ± 0.328
3.115ValArg: 3.115 ± 0.274
7.269ValSer: 7.269 ± 0.381
2.077ValThr: 2.077 ± 0.328
5.192ValVal: 5.192 ± 0.053
1.038ValTrp: 1.038 ± 0.929
3.115ValTyr: 3.115 ± 1.806
0.0ValXaa: 0.0 ± 0.0
Trp
1.038TrpAla: 1.038 ± 0.602
0.0TrpCys: 0.0 ± 0.0
2.077TrpAsp: 2.077 ± 0.328
1.038TrpGlu: 1.038 ± 0.602
1.038TrpPhe: 1.038 ± 0.929
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.077TrpIle: 2.077 ± 1.859
2.077TrpLys: 2.077 ± 1.859
3.115TrpLeu: 3.115 ± 0.274
2.077TrpMet: 2.077 ± 1.204
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.077TrpArg: 2.077 ± 1.859
3.115TrpSer: 3.115 ± 1.257
2.077TrpThr: 2.077 ± 1.859
1.038TrpVal: 1.038 ± 0.602
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.077TyrAla: 2.077 ± 0.328
1.038TyrCys: 1.038 ± 0.929
2.077TyrAsp: 2.077 ± 0.328
1.038TyrGlu: 1.038 ± 0.602
1.038TyrPhe: 1.038 ± 0.602
5.192TyrGly: 5.192 ± 0.053
1.038TyrHis: 1.038 ± 0.929
4.154TyrIle: 4.154 ± 0.876
2.077TyrLys: 2.077 ± 0.328
2.077TyrLeu: 2.077 ± 0.328
0.0TyrMet: 0.0 ± 0.0
5.192TyrAsn: 5.192 ± 3.116
1.038TyrPro: 1.038 ± 0.929
0.0TyrGln: 0.0 ± 0.0
1.038TyrArg: 1.038 ± 0.929
3.115TyrSer: 3.115 ± 1.806
3.115TyrThr: 3.115 ± 0.274
2.077TyrVal: 2.077 ± 1.859
0.0TyrTrp: 0.0 ± 0.0
1.038TyrTyr: 1.038 ± 0.602
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (964 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski