Amino acid dipepetide frequency for Hubei picorna-like virus 76

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.113AlaAla: 5.113 ± 2.147
1.805AlaCys: 1.805 ± 0.927
3.609AlaAsp: 3.609 ± 0.164
1.504AlaGlu: 1.504 ± 0.285
2.707AlaPhe: 2.707 ± 1.181
3.609AlaGly: 3.609 ± 2.686
1.805AlaHis: 1.805 ± 0.684
4.812AlaIle: 4.812 ± 1.222
3.609AlaLys: 3.609 ± 0.757
7.218AlaLeu: 7.218 ± 0.328
2.406AlaMet: 2.406 ± 1.266
3.609AlaAsn: 3.609 ± 2.064
3.008AlaPro: 3.008 ± 1.424
4.511AlaGln: 4.511 ± 1.531
1.805AlaArg: 1.805 ± 0.544
4.211AlaSer: 4.211 ± 1.728
4.211AlaThr: 4.211 ± 3.024
3.91AlaVal: 3.91 ± 1.144
0.902AlaTrp: 0.902 ± 0.306
0.902AlaTyr: 0.902 ± 0.669
0.0AlaXaa: 0.0 ± 0.0
Cys
1.805CysAla: 1.805 ± 0.383
0.301CysCys: 0.301 ± 0.155
0.902CysAsp: 0.902 ± 0.464
0.602CysGlu: 0.602 ± 0.309
1.203CysPhe: 1.203 ± 0.618
1.504CysGly: 1.504 ± 0.773
0.902CysHis: 0.902 ± 0.464
0.602CysIle: 0.602 ± 0.309
1.504CysLys: 1.504 ± 0.773
1.203CysLeu: 1.203 ± 0.618
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.602CysPro: 0.602 ± 0.309
1.203CysGln: 1.203 ± 0.638
0.902CysArg: 0.902 ± 0.464
0.902CysSer: 0.902 ± 0.306
2.707CysThr: 2.707 ± 0.958
2.406CysVal: 2.406 ± 0.505
0.0CysTrp: 0.0 ± 0.0
0.602CysTyr: 0.602 ± 0.309
0.0CysXaa: 0.0 ± 0.0
Asp
2.707AspAla: 2.707 ± 0.516
1.504AspCys: 1.504 ± 0.773
4.211AspAsp: 4.211 ± 1.546
2.707AspGlu: 2.707 ± 1.538
3.91AspPhe: 3.91 ± 1.144
3.008AspGly: 3.008 ± 0.84
1.504AspHis: 1.504 ± 0.773
1.504AspIle: 1.504 ± 0.773
2.105AspLys: 2.105 ± 1.082
5.414AspLeu: 5.414 ± 1.838
0.902AspMet: 0.902 ± 0.464
1.504AspAsn: 1.504 ± 1.369
3.308AspPro: 3.308 ± 0.657
3.308AspGln: 3.308 ± 0.657
2.105AspArg: 2.105 ± 1.082
4.211AspSer: 4.211 ± 1.624
3.609AspThr: 3.609 ± 0.164
3.91AspVal: 3.91 ± 1.906
0.602AspTrp: 0.602 ± 0.414
2.406AspTyr: 2.406 ± 0.848
0.0AspXaa: 0.0 ± 0.0
Glu
2.707GluAla: 2.707 ± 0.793
0.301GluCys: 0.301 ± 0.155
2.406GluAsp: 2.406 ± 1.236
4.511GluGlu: 4.511 ± 2.318
3.609GluPhe: 3.609 ± 1.854
0.301GluGly: 0.301 ± 0.155
2.105GluHis: 2.105 ± 0.539
3.609GluIle: 3.609 ± 0.164
5.113GluLys: 5.113 ± 1.37
6.617GluLeu: 6.617 ± 0.409
1.504GluMet: 1.504 ± 0.773
3.008GluAsn: 3.008 ± 0.427
3.609GluPro: 3.609 ± 0.766
2.406GluGln: 2.406 ± 1.276
1.805GluArg: 1.805 ± 0.927
3.308GluSer: 3.308 ± 1.091
1.203GluThr: 1.203 ± 0.252
4.211GluVal: 4.211 ± 0.794
1.203GluTrp: 1.203 ± 0.618
3.609GluTyr: 3.609 ± 0.766
0.0GluXaa: 0.0 ± 0.0
Phe
2.105PheAla: 2.105 ± 0.539
1.805PheCys: 1.805 ± 0.383
4.812PheAsp: 4.812 ± 1.526
5.113PheGlu: 5.113 ± 0.689
3.008PhePhe: 3.008 ± 1.051
3.008PheGly: 3.008 ± 0.427
1.504PheHis: 1.504 ± 0.712
2.707PheIle: 2.707 ± 0.392
5.714PheLys: 5.714 ± 2.312
1.805PheLeu: 1.805 ± 0.613
1.203PheMet: 1.203 ± 0.565
1.504PheAsn: 1.504 ± 0.285
1.203PhePro: 1.203 ± 0.638
2.105PheGln: 2.105 ± 0.448
2.105PheArg: 2.105 ± 0.51
5.414PheSer: 5.414 ± 0.226
2.406PheThr: 2.406 ± 1.657
3.609PheVal: 3.609 ± 0.757
0.902PheTrp: 0.902 ± 0.306
3.91PheTyr: 3.91 ± 2.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.609GlyAla: 3.609 ± 0.744
0.602GlyCys: 0.602 ± 0.309
2.105GlyAsp: 2.105 ± 0.755
4.211GlyGlu: 4.211 ± 1.079
3.308GlyPhe: 3.308 ± 1.951
3.008GlyGly: 3.008 ± 2.403
0.902GlyHis: 0.902 ± 0.306
3.308GlyIle: 3.308 ± 1.091
5.414GlyLys: 5.414 ± 0.841
3.308GlyLeu: 3.308 ± 2.73
1.504GlyMet: 1.504 ± 0.285
1.504GlyAsn: 1.504 ± 1.585
1.504GlyPro: 1.504 ± 0.285
3.008GlyGln: 3.008 ± 1.794
2.406GlyArg: 2.406 ± 0.505
3.609GlySer: 3.609 ± 0.744
3.008GlyThr: 3.008 ± 0.427
3.308GlyVal: 3.308 ± 1.207
0.301GlyTrp: 0.301 ± 0.155
2.105GlyTyr: 2.105 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
1.504HisAla: 1.504 ± 0.712
1.504HisCys: 1.504 ± 0.643
1.504HisAsp: 1.504 ± 0.773
1.203HisGlu: 1.203 ± 0.618
1.203HisPhe: 1.203 ± 0.618
1.805HisGly: 1.805 ± 0.383
0.902HisHis: 0.902 ± 0.669
0.902HisIle: 0.902 ± 0.464
1.504HisLys: 1.504 ± 0.285
3.91HisLeu: 3.91 ± 0.592
0.902HisMet: 0.902 ± 0.464
0.902HisAsn: 0.902 ± 0.464
2.707HisPro: 2.707 ± 0.516
0.602HisGln: 0.602 ± 0.733
0.902HisArg: 0.902 ± 0.306
1.805HisSer: 1.805 ± 0.613
0.602HisThr: 0.602 ± 0.309
3.308HisVal: 3.308 ± 0.657
0.602HisTrp: 0.602 ± 0.309
0.602HisTyr: 0.602 ± 0.414
0.0HisXaa: 0.0 ± 0.0
Ile
4.511IleAla: 4.511 ± 1.076
0.301IleCys: 0.301 ± 0.155
4.511IleAsp: 4.511 ± 0.855
2.406IleGlu: 2.406 ± 1.236
3.008IlePhe: 3.008 ± 0.57
3.91IleGly: 3.91 ± 0.795
1.504IleHis: 1.504 ± 1.369
2.707IleIle: 2.707 ± 0.793
3.308IleLys: 3.308 ± 1.207
4.511IleLeu: 4.511 ± 0.855
2.105IleMet: 2.105 ± 0.51
4.211IleAsn: 4.211 ± 0.896
3.609IlePro: 3.609 ± 0.744
2.406IleGln: 2.406 ± 0.505
2.105IleArg: 2.105 ± 0.539
4.511IleSer: 4.511 ± 1.712
3.008IleThr: 3.008 ± 0.941
3.008IleVal: 3.008 ± 1.545
0.602IleTrp: 0.602 ± 0.309
1.805IleTyr: 1.805 ± 0.927
0.0IleXaa: 0.0 ± 0.0
Lys
4.211LysAla: 4.211 ± 0.896
0.602LysCys: 0.602 ± 0.309
3.91LysAsp: 3.91 ± 0.76
5.113LysGlu: 5.113 ± 1.204
4.211LysPhe: 4.211 ± 1.019
2.406LysGly: 2.406 ± 0.649
1.203LysHis: 1.203 ± 0.618
5.414LysIle: 5.414 ± 2.158
7.218LysLys: 7.218 ± 2.484
3.308LysLeu: 3.308 ± 1.091
1.805LysMet: 1.805 ± 0.383
5.113LysAsn: 5.113 ± 1.441
2.406LysPro: 2.406 ± 0.649
2.707LysGln: 2.707 ± 0.793
3.308LysArg: 3.308 ± 1.091
3.008LysSer: 3.008 ± 0.941
4.211LysThr: 4.211 ± 0.794
4.812LysVal: 4.812 ± 2.473
0.902LysTrp: 0.902 ± 0.669
2.105LysTyr: 2.105 ± 0.539
0.0LysXaa: 0.0 ± 0.0
Leu
3.91LeuAla: 3.91 ± 1.356
2.105LeuCys: 2.105 ± 1.082
4.511LeuAsp: 4.511 ± 0.827
4.511LeuGlu: 4.511 ± 1.076
3.91LeuPhe: 3.91 ± 0.145
3.308LeuGly: 3.308 ± 0.284
3.308LeuHis: 3.308 ± 0.657
4.511LeuIle: 4.511 ± 1.072
5.714LeuLys: 5.714 ± 2.96
7.218LeuLeu: 7.218 ± 2.176
1.504LeuMet: 1.504 ± 0.773
5.113LeuAsn: 5.113 ± 1.37
4.511LeuPro: 4.511 ± 1.077
3.91LeuGln: 3.91 ± 2.72
4.211LeuArg: 4.211 ± 1.079
6.316LeuSer: 6.316 ± 1.344
4.812LeuThr: 4.812 ± 2.787
4.211LeuVal: 4.211 ± 1.758
1.504LeuTrp: 1.504 ± 0.712
2.406LeuTyr: 2.406 ± 0.649
0.0LeuXaa: 0.0 ± 0.0
Met
1.504MetAla: 1.504 ± 0.662
0.301MetCys: 0.301 ± 0.155
0.902MetAsp: 0.902 ± 0.306
0.902MetGlu: 0.902 ± 0.464
0.902MetPhe: 0.902 ± 0.464
0.902MetGly: 0.902 ± 0.669
0.902MetHis: 0.902 ± 0.464
0.301MetIle: 0.301 ± 0.155
0.602MetLys: 0.602 ± 0.733
3.008MetLeu: 3.008 ± 0.449
0.301MetMet: 0.301 ± 0.155
0.902MetAsn: 0.902 ± 0.464
1.504MetPro: 1.504 ± 0.662
1.504MetGln: 1.504 ± 0.285
0.301MetArg: 0.301 ± 0.155
2.707MetSer: 2.707 ± 1.391
1.805MetThr: 1.805 ± 0.544
0.902MetVal: 0.902 ± 0.464
0.602MetTrp: 0.602 ± 0.309
1.805MetTyr: 1.805 ± 1.243
0.0MetXaa: 0.0 ± 0.0
Asn
3.308AsnAla: 3.308 ± 3.868
1.805AsnCys: 1.805 ± 0.383
0.902AsnAsp: 0.902 ± 0.464
2.707AsnGlu: 2.707 ± 0.392
6.617AsnPhe: 6.617 ± 1.569
2.105AsnGly: 2.105 ± 0.51
1.203AsnHis: 1.203 ± 0.618
1.504AsnIle: 1.504 ± 0.662
2.406AsnLys: 2.406 ± 1.236
6.015AsnLeu: 6.015 ± 4.193
1.504AsnMet: 1.504 ± 0.527
2.105AsnAsn: 2.105 ± 1.512
3.609AsnPro: 3.609 ± 0.757
1.504AsnGln: 1.504 ± 1.585
1.203AsnArg: 1.203 ± 0.618
4.211AsnSer: 4.211 ± 1.912
2.707AsnThr: 2.707 ± 2.525
2.105AsnVal: 2.105 ± 0.755
0.301AsnTrp: 0.301 ± 0.545
2.105AsnTyr: 2.105 ± 0.51
0.0AsnXaa: 0.0 ± 0.0
Pro
2.105ProAla: 2.105 ± 0.539
0.301ProCys: 0.301 ± 0.155
4.211ProAsp: 4.211 ± 1.079
3.609ProGlu: 3.609 ± 1.242
1.805ProPhe: 1.805 ± 0.684
3.308ProGly: 3.308 ± 0.284
0.602ProHis: 0.602 ± 0.309
5.113ProIle: 5.113 ± 1.009
2.707ProLys: 2.707 ± 0.392
5.113ProLeu: 5.113 ± 1.913
0.902ProMet: 0.902 ± 0.464
2.406ProAsn: 2.406 ± 0.649
2.406ProPro: 2.406 ± 1.015
1.504ProGln: 1.504 ± 0.712
0.902ProArg: 0.902 ± 0.464
4.211ProSer: 4.211 ± 2.499
4.511ProThr: 4.511 ± 1.072
4.211ProVal: 4.211 ± 1.546
0.301ProTrp: 0.301 ± 0.155
2.406ProTyr: 2.406 ± 0.649
0.0ProXaa: 0.0 ± 0.0
Gln
3.609GlnAla: 3.609 ± 0.164
0.0GlnCys: 0.0 ± 0.0
1.203GlnAsp: 1.203 ± 0.829
3.91GlnGlu: 3.91 ± 0.795
1.504GlnPhe: 1.504 ± 1.186
2.406GlnGly: 2.406 ± 1.657
2.105GlnHis: 2.105 ± 0.448
2.707GlnIle: 2.707 ± 0.392
3.008GlnLys: 3.008 ± 0.941
2.707GlnLeu: 2.707 ± 0.392
0.902GlnMet: 0.902 ± 0.356
3.609GlnAsn: 3.609 ± 2.014
3.609GlnPro: 3.609 ± 1.088
2.406GlnGln: 2.406 ± 3.827
2.707GlnArg: 2.707 ± 2.086
3.008GlnSer: 3.008 ± 0.449
1.805GlnThr: 1.805 ± 0.613
2.707GlnVal: 2.707 ± 0.919
0.301GlnTrp: 0.301 ± 0.155
1.504GlnTyr: 1.504 ± 0.643
0.0GlnXaa: 0.0 ± 0.0
Arg
2.105ArgAla: 2.105 ± 0.51
0.902ArgCys: 0.902 ± 0.464
1.805ArgAsp: 1.805 ± 0.383
1.504ArgGlu: 1.504 ± 0.773
3.609ArgPhe: 3.609 ± 1.242
1.805ArgGly: 1.805 ± 0.927
1.203ArgHis: 1.203 ± 0.618
2.707ArgIle: 2.707 ± 0.793
3.008ArgLys: 3.008 ± 0.941
3.91ArgLeu: 3.91 ± 0.888
0.0ArgMet: 0.0 ± 0.0
3.308ArgAsn: 3.308 ± 2.517
1.203ArgPro: 1.203 ± 0.252
2.707ArgGln: 2.707 ± 0.958
2.105ArgArg: 2.105 ± 0.51
3.609ArgSer: 3.609 ± 2.106
1.805ArgThr: 1.805 ± 0.927
1.504ArgVal: 1.504 ± 0.712
0.0ArgTrp: 0.0 ± 0.0
0.602ArgTyr: 0.602 ± 0.309
0.0ArgXaa: 0.0 ± 0.0
Ser
7.82SerAla: 7.82 ± 4.075
1.805SerCys: 1.805 ± 0.544
3.308SerAsp: 3.308 ± 0.657
4.511SerGlu: 4.511 ± 1.156
3.308SerPhe: 3.308 ± 0.284
5.113SerGly: 5.113 ± 1.556
2.105SerHis: 2.105 ± 1.082
3.91SerIle: 3.91 ± 1.274
4.211SerLys: 4.211 ± 1.546
4.211SerLeu: 4.211 ± 1.546
1.504SerMet: 1.504 ± 1.395
3.308SerAsn: 3.308 ± 1.913
3.91SerPro: 3.91 ± 0.795
3.008SerGln: 3.008 ± 1.078
3.91SerArg: 3.91 ± 0.592
5.414SerSer: 5.414 ± 1.759
5.414SerThr: 5.414 ± 3.3
5.113SerVal: 5.113 ± 1.93
0.602SerTrp: 0.602 ± 0.733
3.008SerTyr: 3.008 ± 0.57
0.0SerXaa: 0.0 ± 0.0
Thr
3.91ThrAla: 3.91 ± 0.984
1.203ThrCys: 1.203 ± 0.618
3.008ThrAsp: 3.008 ± 1.424
1.504ThrGlu: 1.504 ± 0.285
3.308ThrPhe: 3.308 ± 0.897
3.91ThrGly: 3.91 ± 0.984
1.504ThrHis: 1.504 ± 0.712
2.707ThrIle: 2.707 ± 0.392
3.008ThrLys: 3.008 ± 0.941
6.316ThrLeu: 6.316 ± 3.44
0.902ThrMet: 0.902 ± 0.956
2.707ThrAsn: 2.707 ± 1.181
4.812ThrPro: 4.812 ± 1.297
2.406ThrGln: 2.406 ± 1.266
2.707ThrArg: 2.707 ± 1.391
6.316ThrSer: 6.316 ± 2.073
5.113ThrThr: 5.113 ± 3.597
3.008ThrVal: 3.008 ± 1.424
0.602ThrTrp: 0.602 ± 0.414
2.105ThrTyr: 2.105 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
3.91ValAla: 3.91 ± 1.274
2.105ValCys: 2.105 ± 0.51
3.008ValAsp: 3.008 ± 0.57
2.707ValGlu: 2.707 ± 0.793
3.308ValPhe: 3.308 ± 0.785
3.91ValGly: 3.91 ± 1.356
2.105ValHis: 2.105 ± 0.448
5.414ValIle: 5.414 ± 1.031
4.211ValLys: 4.211 ± 2.164
3.308ValLeu: 3.308 ± 0.785
1.504ValMet: 1.504 ± 0.773
3.008ValAsn: 3.008 ± 0.427
3.008ValPro: 3.008 ± 0.84
2.406ValGln: 2.406 ± 0.505
2.105ValArg: 2.105 ± 0.539
4.812ValSer: 4.812 ± 1.297
4.511ValThr: 4.511 ± 1.076
3.91ValVal: 3.91 ± 0.888
0.902ValTrp: 0.902 ± 0.669
2.406ValTyr: 2.406 ± 0.649
0.0ValXaa: 0.0 ± 0.0
Trp
0.902TrpAla: 0.902 ± 0.956
0.0TrpCys: 0.0 ± 0.0
0.602TrpAsp: 0.602 ± 1.091
0.602TrpGlu: 0.602 ± 0.309
0.602TrpPhe: 0.602 ± 0.309
0.301TrpGly: 0.301 ± 0.155
0.301TrpHis: 0.301 ± 0.155
1.504TrpIle: 1.504 ± 0.773
1.504TrpLys: 1.504 ± 0.285
0.602TrpLeu: 0.602 ± 1.642
0.301TrpMet: 0.301 ± 0.155
0.0TrpAsn: 0.0 ± 0.0
0.301TrpPro: 0.301 ± 0.155
0.0TrpGln: 0.0 ± 0.0
0.301TrpArg: 0.301 ± 0.155
1.805TrpSer: 1.805 ± 0.544
1.805TrpThr: 1.805 ± 0.383
0.301TrpVal: 0.301 ± 0.155
0.602TrpTrp: 0.602 ± 0.309
0.301TrpTyr: 0.301 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.91TyrAla: 3.91 ± 0.888
0.602TyrCys: 0.602 ± 0.309
3.008TyrAsp: 3.008 ± 0.941
3.308TyrGlu: 3.308 ± 1.091
1.203TyrPhe: 1.203 ± 0.252
2.406TyrGly: 2.406 ± 0.649
1.203TyrHis: 1.203 ± 0.252
2.105TyrIle: 2.105 ± 1.124
2.105TyrLys: 2.105 ± 0.51
1.805TyrLeu: 1.805 ± 0.383
0.602TyrMet: 0.602 ± 0.733
2.105TyrAsn: 2.105 ± 0.448
1.805TyrPro: 1.805 ± 0.927
2.105TyrGln: 2.105 ± 0.539
1.504TyrArg: 1.504 ± 0.773
2.105TyrSer: 2.105 ± 0.51
1.805TyrThr: 1.805 ± 0.684
2.105TyrVal: 2.105 ± 0.51
0.902TyrTrp: 0.902 ± 0.464
0.301TyrTyr: 0.301 ± 0.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski