Amino acid dipepetide frequency for Hubei macula-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.139AlaAla: 5.139 ± 1.043
0.514AlaCys: 0.514 ± 0.257
3.597AlaAsp: 3.597 ± 0.76
2.569AlaGlu: 2.569 ± 1.287
2.055AlaPhe: 2.055 ± 1.348
2.055AlaGly: 2.055 ± 0.564
2.569AlaHis: 2.569 ± 1.593
3.597AlaIle: 3.597 ± 1.802
2.569AlaLys: 2.569 ± 1.287
10.791AlaLeu: 10.791 ± 0.352
1.542AlaMet: 1.542 ± 1.479
3.083AlaAsn: 3.083 ± 1.544
6.166AlaPro: 6.166 ± 1.042
3.597AlaGln: 3.597 ± 1.802
5.139AlaArg: 5.139 ± 2.574
8.736AlaSer: 8.736 ± 3.749
5.653AlaThr: 5.653 ± 4.524
4.111AlaVal: 4.111 ± 1.065
1.028AlaTrp: 1.028 ± 0.515
2.055AlaTyr: 2.055 ± 1.03
0.0AlaXaa: 0.0 ± 0.0
Cys
0.514CysAla: 0.514 ± 0.257
0.0CysCys: 0.0 ± 0.0
1.542CysAsp: 1.542 ± 0.704
0.514CysGlu: 0.514 ± 0.257
0.514CysPhe: 0.514 ± 0.257
0.514CysGly: 0.514 ± 0.257
0.514CysHis: 0.514 ± 0.257
0.514CysIle: 0.514 ± 0.257
0.514CysLys: 0.514 ± 0.257
1.028CysLeu: 1.028 ± 0.515
1.028CysMet: 1.028 ± 0.515
0.0CysAsn: 0.0 ± 0.0
0.514CysPro: 0.514 ± 0.257
2.055CysGln: 2.055 ± 1.03
0.514CysArg: 0.514 ± 0.257
3.083CysSer: 3.083 ± 1.495
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.514CysTrp: 0.514 ± 1.118
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.68AspAla: 6.68 ± 2.22
0.0AspCys: 0.0 ± 0.0
3.083AspAsp: 3.083 ± 1.495
2.055AspGlu: 2.055 ± 1.03
4.111AspPhe: 4.111 ± 0.964
2.569AspGly: 2.569 ± 1.287
2.055AspHis: 2.055 ± 0.564
1.542AspIle: 1.542 ± 1.479
1.028AspLys: 1.028 ± 0.898
6.68AspLeu: 6.68 ± 2.163
0.0AspMet: 0.0 ± 0.0
2.569AspAsn: 2.569 ± 1.256
5.139AspPro: 5.139 ± 1.425
1.542AspGln: 1.542 ± 0.704
3.083AspArg: 3.083 ± 1.213
3.083AspSer: 3.083 ± 0.599
0.0AspThr: 0.0 ± 0.0
2.569AspVal: 2.569 ± 2.835
0.514AspTrp: 0.514 ± 0.257
2.569AspTyr: 2.569 ± 1.287
0.0AspXaa: 0.0 ± 0.0
Glu
2.055GluAla: 2.055 ± 0.564
0.0GluCys: 0.0 ± 0.0
1.542GluAsp: 1.542 ± 0.772
1.542GluGlu: 1.542 ± 0.704
2.055GluPhe: 2.055 ± 1.03
1.028GluGly: 1.028 ± 0.515
3.083GluHis: 3.083 ± 1.544
3.083GluIle: 3.083 ± 1.544
1.542GluLys: 1.542 ± 0.772
7.194GluLeu: 7.194 ± 1.521
1.028GluMet: 1.028 ± 0.515
0.514GluAsn: 0.514 ± 0.257
3.083GluPro: 3.083 ± 1.544
2.055GluGln: 2.055 ± 1.03
0.514GluArg: 0.514 ± 0.257
1.542GluSer: 1.542 ± 0.772
3.083GluThr: 3.083 ± 0.599
2.055GluVal: 2.055 ± 1.796
0.514GluTrp: 0.514 ± 0.257
2.055GluTyr: 2.055 ± 1.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.625PheAla: 4.625 ± 2.317
1.542PheCys: 1.542 ± 0.772
3.597PheAsp: 3.597 ± 1.802
3.083PheGlu: 3.083 ± 1.544
3.597PhePhe: 3.597 ± 2.819
4.111PheGly: 4.111 ± 0.964
1.028PheHis: 1.028 ± 0.898
2.055PheIle: 2.055 ± 1.03
0.514PheLys: 0.514 ± 0.257
3.083PheLeu: 3.083 ± 1.544
1.542PheMet: 1.542 ± 0.694
0.514PheAsn: 0.514 ± 1.118
2.055PhePro: 2.055 ± 1.966
3.083PheGln: 3.083 ± 2.578
3.597PheArg: 3.597 ± 0.76
4.111PheSer: 4.111 ± 7.136
4.625PheThr: 4.625 ± 2.317
2.055PheVal: 2.055 ± 0.564
1.028PheTrp: 1.028 ± 0.515
1.028PheTyr: 1.028 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
2.569GlyAla: 2.569 ± 1.727
0.514GlyCys: 0.514 ± 0.257
3.083GlyAsp: 3.083 ± 1.544
2.055GlyGlu: 2.055 ± 0.564
1.028GlyPhe: 1.028 ± 0.515
1.542GlyGly: 1.542 ± 0.772
2.569GlyHis: 2.569 ± 1.593
2.569GlyIle: 2.569 ± 1.727
1.542GlyLys: 1.542 ± 0.772
2.569GlyLeu: 2.569 ± 1.593
0.0GlyMet: 0.0 ± 0.0
2.569GlyAsn: 2.569 ± 0.521
3.597GlyPro: 3.597 ± 1.802
0.0GlyGln: 0.0 ± 0.0
2.569GlyArg: 2.569 ± 1.256
0.514GlySer: 0.514 ± 1.118
0.514GlyThr: 0.514 ± 0.257
4.111GlyVal: 4.111 ± 0.964
1.028GlyTrp: 1.028 ± 0.515
1.028GlyTyr: 1.028 ± 0.515
0.0GlyXaa: 0.0 ± 0.0
His
2.569HisAla: 2.569 ± 1.287
1.028HisCys: 1.028 ± 0.515
2.569HisAsp: 2.569 ± 1.287
2.569HisGlu: 2.569 ± 0.521
1.028HisPhe: 1.028 ± 0.515
1.028HisGly: 1.028 ± 0.515
0.514HisHis: 0.514 ± 0.257
2.055HisIle: 2.055 ± 1.796
3.083HisLys: 3.083 ± 1.544
5.653HisLeu: 5.653 ± 1.668
0.0HisMet: 0.0 ± 0.0
2.055HisAsn: 2.055 ± 1.796
3.083HisPro: 3.083 ± 1.495
1.542HisGln: 1.542 ± 0.772
2.055HisArg: 2.055 ± 1.03
5.139HisSer: 5.139 ± 2.574
1.542HisThr: 1.542 ± 0.772
0.514HisVal: 0.514 ± 0.257
1.028HisTrp: 1.028 ± 0.515
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.111IleAla: 4.111 ± 2.065
0.0IleCys: 0.0 ± 0.0
5.139IleAsp: 5.139 ± 0.751
2.055IleGlu: 2.055 ± 1.03
1.542IlePhe: 1.542 ± 1.479
2.569IleGly: 2.569 ± 0.521
2.569IleHis: 2.569 ± 1.287
3.597IleIle: 3.597 ± 2.321
3.083IleLys: 3.083 ± 1.408
7.194IleLeu: 7.194 ± 1.795
1.542IleMet: 1.542 ± 0.704
3.597IleAsn: 3.597 ± 2.489
4.111IlePro: 4.111 ± 1.127
0.514IleGln: 0.514 ± 0.257
3.083IleArg: 3.083 ± 1.408
2.569IleSer: 2.569 ± 0.521
1.028IleThr: 1.028 ± 0.515
3.597IleVal: 3.597 ± 1.802
0.0IleTrp: 0.0 ± 0.0
4.625IleTyr: 4.625 ± 0.885
0.0IleXaa: 0.0 ± 0.0
Lys
3.597LysAla: 3.597 ± 0.76
0.0LysCys: 0.0 ± 0.0
0.514LysAsp: 0.514 ± 0.257
2.055LysGlu: 2.055 ± 1.03
3.597LysPhe: 3.597 ± 1.224
1.028LysGly: 1.028 ± 0.515
1.028LysHis: 1.028 ± 0.515
4.111LysIle: 4.111 ± 1.127
1.028LysLys: 1.028 ± 0.515
3.083LysLeu: 3.083 ± 0.599
2.055LysMet: 2.055 ± 1.03
3.083LysAsn: 3.083 ± 1.544
2.055LysPro: 2.055 ± 0.564
2.055LysGln: 2.055 ± 0.564
1.542LysArg: 1.542 ± 0.772
3.083LysSer: 3.083 ± 2.578
2.569LysThr: 2.569 ± 1.593
2.569LysVal: 2.569 ± 1.256
1.542LysTrp: 1.542 ± 0.772
0.514LysTyr: 0.514 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
7.194LeuAla: 7.194 ± 2.447
0.514LeuCys: 0.514 ± 1.118
5.139LeuAsp: 5.139 ± 3.454
2.569LeuGlu: 2.569 ± 1.287
6.68LeuPhe: 6.68 ± 2.163
4.625LeuGly: 4.625 ± 1.808
4.625LeuHis: 4.625 ± 2.317
6.166LeuIle: 6.166 ± 5.388
9.25LeuLys: 9.25 ± 3.426
15.93LeuLeu: 15.93 ± 4.401
1.028LeuMet: 1.028 ± 0.515
2.569LeuAsn: 2.569 ± 0.521
12.333LeuPro: 12.333 ± 4.445
4.111LeuGln: 4.111 ± 1.065
4.625LeuArg: 4.625 ± 2.317
14.388LeuSer: 14.388 ± 1.876
6.166LeuThr: 6.166 ± 3.089
4.111LeuVal: 4.111 ± 2.059
0.0LeuTrp: 0.0 ± 0.0
2.055LeuTyr: 2.055 ± 1.03
0.0LeuXaa: 0.0 ± 0.0
Met
2.569MetAla: 2.569 ± 3.114
0.0MetCys: 0.0 ± 0.0
1.028MetAsp: 1.028 ± 2.236
0.514MetGlu: 0.514 ± 0.257
1.028MetPhe: 1.028 ± 0.515
0.0MetGly: 0.0 ± 0.0
2.055MetHis: 2.055 ± 1.03
0.514MetIle: 0.514 ± 0.257
1.028MetLys: 1.028 ± 0.515
2.055MetLeu: 2.055 ± 1.03
0.514MetMet: 0.514 ± 0.257
0.514MetAsn: 0.514 ± 0.257
0.514MetPro: 0.514 ± 0.257
0.514MetGln: 0.514 ± 1.825
1.028MetArg: 1.028 ± 0.515
0.0MetSer: 0.0 ± 0.0
0.514MetThr: 0.514 ± 0.257
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.514MetTyr: 0.514 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 0.76
0.0AsnCys: 0.0 ± 0.0
2.055AsnAsp: 2.055 ± 1.03
1.028AsnGlu: 1.028 ± 0.515
4.625AsnPhe: 4.625 ± 1.189
1.028AsnGly: 1.028 ± 0.515
1.028AsnHis: 1.028 ± 0.898
2.055AsnIle: 2.055 ± 1.796
1.542AsnLys: 1.542 ± 0.772
6.166AsnLeu: 6.166 ± 3.089
1.028AsnMet: 1.028 ± 0.702
1.542AsnAsn: 1.542 ± 0.704
5.139AsnPro: 5.139 ± 3.186
3.083AsnGln: 3.083 ± 0.599
1.028AsnArg: 1.028 ± 0.515
2.055AsnSer: 2.055 ± 1.03
1.542AsnThr: 1.542 ± 0.772
3.597AsnVal: 3.597 ± 1.224
0.0AsnTrp: 0.0 ± 0.0
1.028AsnTyr: 1.028 ± 0.515
0.0AsnXaa: 0.0 ± 0.0
Pro
7.194ProAla: 7.194 ± 2.498
3.083ProCys: 3.083 ± 0.599
0.514ProAsp: 0.514 ± 0.257
4.625ProGlu: 4.625 ± 1.189
3.083ProPhe: 3.083 ± 0.599
1.028ProGly: 1.028 ± 0.515
2.055ProHis: 2.055 ± 1.03
4.625ProIle: 4.625 ± 1.808
2.569ProLys: 2.569 ± 0.521
7.194ProLeu: 7.194 ± 1.566
0.514ProMet: 0.514 ± 0.257
3.597ProAsn: 3.597 ± 0.76
6.166ProPro: 6.166 ± 2.817
3.083ProGln: 3.083 ± 1.495
2.569ProArg: 2.569 ± 4.092
11.819ProSer: 11.819 ± 4.231
5.139ProThr: 5.139 ± 1.425
5.139ProVal: 5.139 ± 1.552
0.514ProTrp: 0.514 ± 0.257
3.083ProTyr: 3.083 ± 1.408
0.0ProXaa: 0.0 ± 0.0
Gln
4.625GlnAla: 4.625 ± 1.189
1.028GlnCys: 1.028 ± 0.898
0.0GlnAsp: 0.0 ± 0.0
1.028GlnGlu: 1.028 ± 0.515
3.083GlnPhe: 3.083 ± 1.544
2.055GlnGly: 2.055 ± 0.564
2.055GlnHis: 2.055 ± 1.03
3.597GlnIle: 3.597 ± 1.224
0.514GlnLys: 0.514 ± 0.257
6.68GlnLeu: 6.68 ± 2.163
0.0GlnMet: 0.0 ± 0.0
2.569GlnAsn: 2.569 ± 1.287
2.055GlnPro: 2.055 ± 1.03
1.028GlnGln: 1.028 ± 0.515
3.597GlnArg: 3.597 ± 4.753
3.597GlnSer: 3.597 ± 2.321
2.569GlnThr: 2.569 ± 2.835
0.514GlnVal: 0.514 ± 1.118
0.0GlnTrp: 0.0 ± 0.0
0.514GlnTyr: 0.514 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
3.083ArgAla: 3.083 ± 1.213
1.028ArgCys: 1.028 ± 0.515
2.569ArgAsp: 2.569 ± 0.521
3.083ArgGlu: 3.083 ± 0.599
3.597ArgPhe: 3.597 ± 1.249
2.569ArgGly: 2.569 ± 2.835
2.055ArgHis: 2.055 ± 1.348
4.111ArgIle: 4.111 ± 2.059
1.542ArgLys: 1.542 ± 5.474
4.625ArgLeu: 4.625 ± 1.055
0.0ArgMet: 0.0 ± 0.0
4.111ArgAsn: 4.111 ± 2.059
1.028ArgPro: 1.028 ± 0.515
2.055ArgGln: 2.055 ± 1.03
2.055ArgArg: 2.055 ± 1.03
6.166ArgSer: 6.166 ± 3.089
2.055ArgThr: 2.055 ± 1.03
2.569ArgVal: 2.569 ± 1.256
1.028ArgTrp: 1.028 ± 0.515
3.597ArgTyr: 3.597 ± 1.224
0.0ArgXaa: 0.0 ± 0.0
Ser
6.68SerAla: 6.68 ± 1.344
2.569SerCys: 2.569 ± 1.256
4.625SerAsp: 4.625 ± 2.889
4.625SerGlu: 4.625 ± 1.055
3.083SerPhe: 3.083 ± 2.959
4.625SerGly: 4.625 ± 1.055
2.569SerHis: 2.569 ± 1.287
6.166SerIle: 6.166 ± 1.691
3.597SerLys: 3.597 ± 1.249
7.708SerLeu: 7.708 ± 2.003
1.028SerMet: 1.028 ± 1.339
6.68SerAsn: 6.68 ± 1.344
6.166SerPro: 6.166 ± 3.992
5.139SerGln: 5.139 ± 2.512
3.597SerArg: 3.597 ± 1.272
10.791SerSer: 10.791 ± 9.56
6.68SerThr: 6.68 ± 2.763
4.625SerVal: 4.625 ± 1.055
1.028SerTrp: 1.028 ± 0.515
4.625SerTyr: 4.625 ± 1.055
0.0SerXaa: 0.0 ± 0.0
Thr
4.111ThrAla: 4.111 ± 0.964
1.542ThrCys: 1.542 ± 0.772
2.569ThrAsp: 2.569 ± 1.256
0.514ThrGlu: 0.514 ± 0.257
0.514ThrPhe: 0.514 ± 0.257
1.028ThrGly: 1.028 ± 0.515
1.542ThrHis: 1.542 ± 0.772
2.055ThrIle: 2.055 ± 0.564
2.055ThrLys: 2.055 ± 0.564
7.194ThrLeu: 7.194 ± 2.544
1.028ThrMet: 1.028 ± 1.641
2.055ThrAsn: 2.055 ± 1.03
5.139ThrPro: 5.139 ± 1.043
2.569ThrGln: 2.569 ± 1.287
4.625ThrArg: 4.625 ± 1.189
3.597ThrSer: 3.597 ± 2.819
1.028ThrThr: 1.028 ± 1.641
4.111ThrVal: 4.111 ± 1.127
0.514ThrTrp: 0.514 ± 0.257
3.083ThrTyr: 3.083 ± 1.544
0.0ThrXaa: 0.0 ± 0.0
Val
3.597ValAla: 3.597 ± 0.76
0.0ValCys: 0.0 ± 0.0
3.597ValAsp: 3.597 ± 1.802
1.542ValGlu: 1.542 ± 0.772
3.597ValPhe: 3.597 ± 2.321
1.542ValGly: 1.542 ± 1.479
2.569ValHis: 2.569 ± 1.287
2.055ValIle: 2.055 ± 1.348
1.542ValLys: 1.542 ± 0.704
4.111ValLeu: 4.111 ± 2.294
0.514ValMet: 0.514 ± 0.257
1.028ValAsn: 1.028 ± 0.515
5.139ValPro: 5.139 ± 2.812
0.514ValGln: 0.514 ± 1.118
5.139ValArg: 5.139 ± 1.54
7.194ValSer: 7.194 ± 1.566
3.083ValThr: 3.083 ± 1.544
6.166ValVal: 6.166 ± 5.156
0.514ValTrp: 0.514 ± 1.825
3.083ValTyr: 3.083 ± 2.959
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.515
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.028TrpGlu: 1.028 ± 0.515
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.514TrpHis: 0.514 ± 0.257
0.514TrpIle: 0.514 ± 0.257
1.028TrpLys: 1.028 ± 0.515
2.055TrpLeu: 2.055 ± 1.03
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.514TrpGln: 0.514 ± 0.257
1.542TrpArg: 1.542 ± 0.772
1.028TrpSer: 1.028 ± 0.898
0.514TrpThr: 0.514 ± 0.257
1.028TrpVal: 1.028 ± 1.641
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.028TyrAla: 1.028 ± 0.515
0.514TyrCys: 0.514 ± 0.257
4.111TyrAsp: 4.111 ± 0.964
0.514TyrGlu: 0.514 ± 0.257
2.055TyrPhe: 2.055 ± 0.564
1.028TyrGly: 1.028 ± 0.515
2.055TyrHis: 2.055 ± 1.03
1.542TyrIle: 1.542 ± 0.772
1.542TyrLys: 1.542 ± 0.704
3.083TyrLeu: 3.083 ± 1.213
0.0TyrMet: 0.0 ± 0.0
1.028TyrAsn: 1.028 ± 0.515
4.111TyrPro: 4.111 ± 1.287
2.055TyrGln: 2.055 ± 0.564
1.028TyrArg: 1.028 ± 0.515
4.111TyrSer: 4.111 ± 1.287
2.569TyrThr: 2.569 ± 0.521
3.083TyrVal: 3.083 ± 1.213
0.0TyrTrp: 0.0 ± 0.0
1.542TyrTyr: 1.542 ± 0.772
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1947 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski