Amino acid dipepetide frequency for Hubei picorna-like virus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.481AlaAla: 4.481 ± 0.269
2.037AlaCys: 2.037 ± 0.327
1.222AlaAsp: 1.222 ± 0.055
3.259AlaGlu: 3.259 ± 0.324
2.444AlaPhe: 2.444 ± 1.302
4.073AlaGly: 4.073 ± 0.654
0.407AlaHis: 0.407 ± 0.489
4.073AlaIle: 4.073 ± 0.758
4.073AlaLys: 4.073 ± 0.052
5.295AlaLeu: 5.295 ± 0.709
2.037AlaMet: 2.037 ± 1.739
4.073AlaAsn: 4.073 ± 0.758
2.444AlaPro: 2.444 ± 0.11
1.222AlaGln: 1.222 ± 0.055
2.444AlaArg: 2.444 ± 0.11
4.888AlaSer: 4.888 ± 0.926
2.851AlaThr: 2.851 ± 1.305
5.295AlaVal: 5.295 ± 2.121
1.629AlaTrp: 1.629 ± 0.868
0.815AlaTyr: 0.815 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
1.222CysAla: 1.222 ± 0.761
0.0CysCys: 0.0 ± 0.0
1.222CysAsp: 1.222 ± 0.055
0.815CysGlu: 0.815 ± 0.434
0.815CysPhe: 0.815 ± 0.434
1.222CysGly: 1.222 ± 0.651
0.407CysHis: 0.407 ± 0.217
1.222CysIle: 1.222 ± 0.055
0.407CysLys: 0.407 ± 0.217
1.222CysLeu: 1.222 ± 0.651
0.815CysMet: 0.815 ± 0.434
0.0CysAsn: 0.0 ± 0.0
0.815CysPro: 0.815 ± 0.434
0.815CysGln: 0.815 ± 0.434
0.407CysArg: 0.407 ± 0.217
0.0CysSer: 0.0 ± 0.0
0.407CysThr: 0.407 ± 0.489
1.222CysVal: 1.222 ± 0.761
0.0CysTrp: 0.0 ± 0.0
1.222CysTyr: 1.222 ± 0.761
0.0CysXaa: 0.0 ± 0.0
Asp
3.259AspAla: 3.259 ± 1.03
0.407AspCys: 0.407 ± 0.217
3.259AspAsp: 3.259 ± 0.324
4.073AspGlu: 4.073 ± 0.758
2.851AspPhe: 2.851 ± 0.599
4.073AspGly: 4.073 ± 2.771
0.815AspHis: 0.815 ± 0.272
4.073AspIle: 4.073 ± 0.052
2.851AspLys: 2.851 ± 0.813
5.703AspLeu: 5.703 ± 1.198
0.407AspMet: 0.407 ± 0.217
2.037AspAsn: 2.037 ± 1.085
2.851AspPro: 2.851 ± 0.599
1.629AspGln: 1.629 ± 0.544
3.666AspArg: 3.666 ± 0.541
4.073AspSer: 4.073 ± 2.169
1.629AspThr: 1.629 ± 0.544
5.295AspVal: 5.295 ± 0.709
2.444AspTrp: 2.444 ± 0.816
2.851AspTyr: 2.851 ± 0.107
0.0AspXaa: 0.0 ± 0.0
Glu
4.073GluAla: 4.073 ± 0.052
2.037GluCys: 2.037 ± 0.327
4.073GluAsp: 4.073 ± 0.654
3.666GluGlu: 3.666 ± 0.541
2.444GluPhe: 2.444 ± 0.596
4.481GluGly: 4.481 ± 0.974
0.407GluHis: 0.407 ± 0.217
3.666GluIle: 3.666 ± 0.541
4.073GluLys: 4.073 ± 0.052
4.888GluLeu: 4.888 ± 1.191
2.037GluMet: 2.037 ± 1.033
3.259GluAsn: 3.259 ± 1.03
0.815GluPro: 0.815 ± 0.434
1.222GluGln: 1.222 ± 0.651
2.851GluArg: 2.851 ± 1.518
2.037GluSer: 2.037 ± 1.085
3.259GluThr: 3.259 ± 1.03
4.481GluVal: 4.481 ± 1.849
1.629GluTrp: 1.629 ± 0.868
1.222GluTyr: 1.222 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
1.222PheAla: 1.222 ± 0.761
0.407PheCys: 0.407 ± 0.217
6.517PheAsp: 6.517 ± 0.647
4.481PheGlu: 4.481 ± 0.269
0.407PhePhe: 0.407 ± 0.217
1.629PheGly: 1.629 ± 0.162
1.222PheHis: 1.222 ± 0.761
2.037PheIle: 2.037 ± 0.327
3.666PheLys: 3.666 ± 1.952
4.888PheLeu: 4.888 ± 0.926
1.222PheMet: 1.222 ± 0.651
3.666PheAsn: 3.666 ± 1.246
1.629PhePro: 1.629 ± 0.868
1.222PheGln: 1.222 ± 0.055
2.037PheArg: 2.037 ± 0.379
3.666PheSer: 3.666 ± 0.541
2.037PheThr: 2.037 ± 0.379
2.851PheVal: 2.851 ± 0.107
0.815PheTrp: 0.815 ± 0.434
1.629PheTyr: 1.629 ± 0.868
0.0PheXaa: 0.0 ± 0.0
Gly
3.259GlyAla: 3.259 ± 1.088
0.407GlyCys: 0.407 ± 0.489
5.295GlyAsp: 5.295 ± 0.709
2.851GlyGlu: 2.851 ± 0.599
2.851GlyPhe: 2.851 ± 0.107
6.517GlyGly: 6.517 ± 7.116
1.629GlyHis: 1.629 ± 0.868
3.666GlyIle: 3.666 ± 0.165
4.888GlyLys: 4.888 ± 0.926
5.295GlyLeu: 5.295 ± 2.121
0.407GlyMet: 0.407 ± 0.217
2.444GlyAsn: 2.444 ± 0.596
1.629GlyPro: 1.629 ± 0.544
0.815GlyGln: 0.815 ± 0.272
4.073GlyArg: 4.073 ± 1.36
3.259GlySer: 3.259 ± 0.324
2.037GlyThr: 2.037 ± 0.327
4.073GlyVal: 4.073 ± 0.052
0.407GlyTrp: 0.407 ± 0.489
4.481GlyTyr: 4.481 ± 0.269
0.0GlyXaa: 0.0 ± 0.0
His
1.629HisAla: 1.629 ± 0.162
0.407HisCys: 0.407 ± 0.217
0.815HisAsp: 0.815 ± 0.434
0.407HisGlu: 0.407 ± 0.217
0.815HisPhe: 0.815 ± 0.272
1.222HisGly: 1.222 ± 0.651
0.407HisHis: 0.407 ± 0.217
2.037HisIle: 2.037 ± 1.085
1.629HisLys: 1.629 ± 0.868
0.407HisLeu: 0.407 ± 0.217
0.407HisMet: 0.407 ± 0.217
0.407HisAsn: 0.407 ± 0.217
2.037HisPro: 2.037 ± 0.327
0.407HisGln: 0.407 ± 0.217
0.407HisArg: 0.407 ± 0.217
2.444HisSer: 2.444 ± 0.816
0.407HisThr: 0.407 ± 0.489
0.815HisVal: 0.815 ± 0.272
0.407HisTrp: 0.407 ± 0.217
2.037HisTyr: 2.037 ± 1.085
0.0HisXaa: 0.0 ± 0.0
Ile
5.295IleAla: 5.295 ± 2.114
1.629IleCys: 1.629 ± 0.868
5.295IleAsp: 5.295 ± 0.003
2.851IleGlu: 2.851 ± 0.599
3.259IlePhe: 3.259 ± 1.03
4.073IleGly: 4.073 ± 0.758
1.629IleHis: 1.629 ± 0.162
3.259IleIle: 3.259 ± 1.03
3.259IleLys: 3.259 ± 1.03
5.703IleLeu: 5.703 ± 1.904
1.629IleMet: 1.629 ± 0.868
3.666IleAsn: 3.666 ± 0.871
4.073IlePro: 4.073 ± 2.066
2.037IleGln: 2.037 ± 0.379
3.666IleArg: 3.666 ± 1.246
6.925IleSer: 6.925 ± 0.547
2.851IleThr: 2.851 ± 2.716
5.703IleVal: 5.703 ± 1.625
0.0IleTrp: 0.0 ± 0.0
2.851IleTyr: 2.851 ± 0.813
0.0IleXaa: 0.0 ± 0.0
Lys
3.666LysAla: 3.666 ± 1.577
0.407LysCys: 0.407 ± 0.217
3.259LysAsp: 3.259 ± 1.03
4.888LysGlu: 4.888 ± 1.897
4.888LysPhe: 4.888 ± 1.897
4.073LysGly: 4.073 ± 2.066
1.222LysHis: 1.222 ± 0.651
6.517LysIle: 6.517 ± 1.353
2.851LysLys: 2.851 ± 0.813
4.888LysLeu: 4.888 ± 0.22
3.259LysMet: 3.259 ± 1.276
2.037LysAsn: 2.037 ± 0.379
3.259LysPro: 3.259 ± 0.382
5.295LysGln: 5.295 ± 1.415
3.666LysArg: 3.666 ± 0.541
4.481LysSer: 4.481 ± 1.68
2.444LysThr: 2.444 ± 0.11
4.481LysVal: 4.481 ± 0.269
0.815LysTrp: 0.815 ± 0.434
2.037LysTyr: 2.037 ± 1.085
0.0LysXaa: 0.0 ± 0.0
Leu
6.11LeuAla: 6.11 ± 0.275
0.407LeuCys: 0.407 ± 0.489
5.703LeuAsp: 5.703 ± 0.214
6.11LeuGlu: 6.11 ± 1.842
3.666LeuPhe: 3.666 ± 0.541
5.703LeuGly: 5.703 ± 0.214
1.629LeuHis: 1.629 ± 0.868
5.703LeuIle: 5.703 ± 1.625
4.888LeuLys: 4.888 ± 0.486
6.517LeuLeu: 6.517 ± 1.353
1.222LeuMet: 1.222 ± 0.761
4.481LeuAsn: 4.481 ± 1.143
6.11LeuPro: 6.11 ± 1.687
2.851LeuGln: 2.851 ± 0.813
2.851LeuArg: 2.851 ± 0.599
6.517LeuSer: 6.517 ± 0.647
4.888LeuThr: 4.888 ± 0.926
8.554LeuVal: 8.554 ± 1.797
2.037LeuTrp: 2.037 ± 0.327
2.037LeuTyr: 2.037 ± 0.327
0.0LeuXaa: 0.0 ± 0.0
Met
1.629MetAla: 1.629 ± 0.868
0.0MetCys: 0.0 ± 0.0
2.037MetAsp: 2.037 ± 1.033
1.222MetGlu: 1.222 ± 0.055
1.629MetPhe: 1.629 ± 0.868
1.222MetGly: 1.222 ± 0.761
0.0MetHis: 0.0 ± 0.0
2.851MetIle: 2.851 ± 0.813
0.815MetLys: 0.815 ± 0.272
1.222MetLeu: 1.222 ± 0.651
0.0MetMet: 0.0 ± 0.184
1.222MetAsn: 1.222 ± 0.055
1.629MetPro: 1.629 ± 0.544
1.222MetGln: 1.222 ± 0.055
2.037MetArg: 2.037 ± 0.327
1.629MetSer: 1.629 ± 1.25
0.815MetThr: 0.815 ± 0.434
1.222MetVal: 1.222 ± 0.651
0.407MetTrp: 0.407 ± 0.217
1.629MetTyr: 1.629 ± 0.868
0.0MetXaa: 0.0 ± 0.0
Asn
3.259AsnAla: 3.259 ± 1.794
0.407AsnCys: 0.407 ± 0.217
1.629AsnAsp: 1.629 ± 0.868
1.629AsnGlu: 1.629 ± 0.162
0.815AsnPhe: 0.815 ± 0.434
2.851AsnGly: 2.851 ± 0.107
1.629AsnHis: 1.629 ± 0.162
2.851AsnIle: 2.851 ± 1.518
2.851AsnLys: 2.851 ± 0.813
6.11AsnLeu: 6.11 ± 1.136
2.037AsnMet: 2.037 ± 0.327
0.0AsnAsn: 0.0 ± 0.0
4.073AsnPro: 4.073 ± 0.758
1.629AsnGln: 1.629 ± 0.868
0.407AsnArg: 0.407 ± 0.489
4.888AsnSer: 4.888 ± 0.486
1.222AsnThr: 1.222 ± 0.055
3.259AsnVal: 3.259 ± 0.382
1.222AsnTrp: 1.222 ± 0.055
2.037AsnTyr: 2.037 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
2.444ProAla: 2.444 ± 0.596
0.815ProCys: 0.815 ± 0.434
2.037ProAsp: 2.037 ± 0.327
2.444ProGlu: 2.444 ± 0.596
3.259ProPhe: 3.259 ± 1.794
1.629ProGly: 1.629 ± 0.544
0.407ProHis: 0.407 ± 0.217
2.444ProIle: 2.444 ± 0.596
4.481ProLys: 4.481 ± 0.269
5.703ProLeu: 5.703 ± 1.198
0.815ProMet: 0.815 ± 0.272
3.259ProAsn: 3.259 ± 0.382
2.444ProPro: 2.444 ± 0.596
2.444ProGln: 2.444 ± 0.816
2.851ProArg: 2.851 ± 0.599
2.037ProSer: 2.037 ± 0.379
3.666ProThr: 3.666 ± 1.577
4.481ProVal: 4.481 ± 1.849
0.0ProTrp: 0.0 ± 0.0
3.259ProTyr: 3.259 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
2.037GlnAla: 2.037 ± 1.739
0.815GlnCys: 0.815 ± 0.434
0.407GlnAsp: 0.407 ± 0.489
2.037GlnGlu: 2.037 ± 0.379
2.444GlnPhe: 2.444 ± 0.596
1.629GlnGly: 1.629 ± 0.544
1.629GlnHis: 1.629 ± 0.162
1.222GlnIle: 1.222 ± 0.651
1.629GlnLys: 1.629 ± 0.162
3.666GlnLeu: 3.666 ± 0.541
1.629GlnMet: 1.629 ± 0.162
0.407GlnAsn: 0.407 ± 0.217
2.037GlnPro: 2.037 ± 0.327
4.481GlnGln: 4.481 ± 0.437
0.407GlnArg: 0.407 ± 0.217
2.851GlnSer: 2.851 ± 0.107
2.037GlnThr: 2.037 ± 0.379
2.037GlnVal: 2.037 ± 0.327
0.407GlnTrp: 0.407 ± 0.489
0.815GlnTyr: 0.815 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
2.444ArgAla: 2.444 ± 0.816
0.815ArgCys: 0.815 ± 0.434
3.259ArgAsp: 3.259 ± 1.03
2.037ArgGlu: 2.037 ± 0.379
2.444ArgPhe: 2.444 ± 0.11
2.444ArgGly: 2.444 ± 0.11
0.815ArgHis: 0.815 ± 0.434
5.295ArgIle: 5.295 ± 1.415
3.666ArgLys: 3.666 ± 1.246
4.073ArgLeu: 4.073 ± 0.052
0.815ArgMet: 0.815 ± 0.272
3.259ArgAsn: 3.259 ± 1.735
2.444ArgPro: 2.444 ± 0.11
1.629ArgGln: 1.629 ± 0.162
3.666ArgArg: 3.666 ± 1.246
2.444ArgSer: 2.444 ± 0.596
3.259ArgThr: 3.259 ± 0.382
2.444ArgVal: 2.444 ± 0.816
0.815ArgTrp: 0.815 ± 0.434
1.222ArgTyr: 1.222 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
3.666SerAla: 3.666 ± 0.541
1.222SerCys: 1.222 ± 0.055
0.815SerAsp: 0.815 ± 0.272
2.851SerGlu: 2.851 ± 0.107
4.481SerPhe: 4.481 ± 0.437
4.073SerGly: 4.073 ± 0.654
1.222SerHis: 1.222 ± 0.651
8.554SerIle: 8.554 ± 2.503
5.295SerLys: 5.295 ± 0.703
6.11SerLeu: 6.11 ± 1.842
2.037SerMet: 2.037 ± 0.379
3.666SerAsn: 3.666 ± 0.541
3.259SerPro: 3.259 ± 1.794
2.037SerGln: 2.037 ± 0.379
3.666SerArg: 3.666 ± 0.871
5.295SerSer: 5.295 ± 1.408
4.481SerThr: 4.481 ± 1.849
5.703SerVal: 5.703 ± 2.331
0.815SerTrp: 0.815 ± 0.272
2.851SerTyr: 2.851 ± 0.107
0.0SerXaa: 0.0 ± 0.0
Thr
2.444ThrAla: 2.444 ± 1.302
1.222ThrCys: 1.222 ± 0.761
2.851ThrAsp: 2.851 ± 1.305
4.481ThrGlu: 4.481 ± 0.269
1.629ThrPhe: 1.629 ± 0.544
3.259ThrGly: 3.259 ± 3.205
1.629ThrHis: 1.629 ± 0.868
4.073ThrIle: 4.073 ± 0.654
3.259ThrLys: 3.259 ± 1.794
2.444ThrLeu: 2.444 ± 0.596
0.815ThrMet: 0.815 ± 0.434
0.815ThrAsn: 0.815 ± 0.978
2.037ThrPro: 2.037 ± 0.327
1.222ThrGln: 1.222 ± 0.761
4.073ThrArg: 4.073 ± 0.654
4.481ThrSer: 4.481 ± 2.555
2.037ThrThr: 2.037 ± 0.379
2.851ThrVal: 2.851 ± 0.107
0.407ThrTrp: 0.407 ± 0.217
2.037ThrTyr: 2.037 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
4.073ValAla: 4.073 ± 1.36
0.0ValCys: 0.0 ± 0.0
5.295ValAsp: 5.295 ± 0.709
4.073ValGlu: 4.073 ± 0.654
3.666ValPhe: 3.666 ± 0.541
3.666ValGly: 3.666 ± 0.541
1.222ValHis: 1.222 ± 0.761
4.073ValIle: 4.073 ± 0.654
7.739ValLys: 7.739 ± 0.592
8.147ValLeu: 8.147 ± 0.602
1.629ValMet: 1.629 ± 0.868
4.481ValAsn: 4.481 ± 0.437
3.259ValPro: 3.259 ± 0.382
2.037ValGln: 2.037 ± 0.327
3.666ValArg: 3.666 ± 1.246
5.703ValSer: 5.703 ± 1.904
4.073ValThr: 4.073 ± 0.654
4.888ValVal: 4.888 ± 1.632
0.407ValTrp: 0.407 ± 0.217
3.666ValTyr: 3.666 ± 0.871
0.0ValXaa: 0.0 ± 0.0
Trp
1.629TrpAla: 1.629 ± 0.544
0.815TrpCys: 0.815 ± 0.434
0.815TrpAsp: 0.815 ± 0.434
0.815TrpGlu: 0.815 ± 0.272
0.407TrpPhe: 0.407 ± 0.217
0.815TrpGly: 0.815 ± 0.434
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.851TrpLys: 2.851 ± 0.107
1.222TrpLeu: 1.222 ± 0.651
0.0TrpMet: 0.0 ± 0.0
0.407TrpAsn: 0.407 ± 0.217
0.815TrpPro: 0.815 ± 0.272
0.407TrpGln: 0.407 ± 0.217
0.407TrpArg: 0.407 ± 0.217
2.037TrpSer: 2.037 ± 1.033
0.815TrpThr: 0.815 ± 0.272
1.222TrpVal: 1.222 ± 0.055
0.815TrpTrp: 0.815 ± 0.272
0.815TrpTyr: 0.815 ± 0.434
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.222TyrAla: 1.222 ± 0.055
0.0TyrCys: 0.0 ± 0.0
2.444TyrAsp: 2.444 ± 0.11
1.629TyrGlu: 1.629 ± 0.868
2.037TyrPhe: 2.037 ± 1.085
1.629TyrGly: 1.629 ± 0.162
1.222TyrHis: 1.222 ± 0.651
2.037TyrIle: 2.037 ± 1.033
3.259TyrLys: 3.259 ± 0.324
4.481TyrLeu: 4.481 ± 0.269
1.222TyrMet: 1.222 ± 0.055
1.222TyrAsn: 1.222 ± 0.055
3.259TyrPro: 3.259 ± 0.324
0.0TyrGln: 0.0 ± 0.0
2.037TyrArg: 2.037 ± 1.085
2.444TyrSer: 2.444 ± 0.11
2.851TyrThr: 2.851 ± 0.599
4.888TyrVal: 4.888 ± 0.486
1.629TyrTrp: 1.629 ± 0.544
2.037TyrTyr: 2.037 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2456 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski