Amino acid dipepetide frequency for Hubei picorna-like virus 67

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.243AlaAla: 5.243 ± 1.183
0.477AlaCys: 0.477 ± 0.23
3.098AlaAsp: 3.098 ± 1.458
1.43AlaGlu: 1.43 ± 1.859
2.622AlaPhe: 2.622 ± 1.267
5.005AlaGly: 5.005 ± 2.846
1.668AlaHis: 1.668 ± 0.782
3.337AlaIle: 3.337 ± 0.641
3.575AlaLys: 3.575 ± 0.178
5.243AlaLeu: 5.243 ± 2.232
2.145AlaMet: 2.145 ± 1.037
3.098AlaAsn: 3.098 ± 1.498
2.383AlaPro: 2.383 ± 0.615
3.337AlaGln: 3.337 ± 1.259
4.051AlaArg: 4.051 ± 1.946
3.575AlaSer: 3.575 ± 1.467
5.005AlaThr: 5.005 ± 1.153
2.86AlaVal: 2.86 ± 1.383
0.953AlaTrp: 0.953 ± 0.364
1.907AlaTyr: 1.907 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
1.192CysAla: 1.192 ± 0.576
0.953CysCys: 0.953 ± 0.461
1.668CysAsp: 1.668 ± 0.385
2.383CysGlu: 2.383 ± 1.152
0.715CysPhe: 0.715 ± 0.346
1.192CysGly: 1.192 ± 0.576
0.477CysHis: 0.477 ± 0.23
0.715CysIle: 0.715 ± 0.346
0.477CysLys: 0.477 ± 0.23
1.907CysLeu: 1.907 ± 0.728
0.477CysMet: 0.477 ± 0.23
0.715CysAsn: 0.715 ± 0.346
1.192CysPro: 1.192 ± 0.576
0.715CysGln: 0.715 ± 1.098
0.953CysArg: 0.953 ± 1.009
1.668CysSer: 1.668 ± 0.385
0.715CysThr: 0.715 ± 0.346
0.953CysVal: 0.953 ± 0.461
0.238CysTrp: 0.238 ± 0.115
0.715CysTyr: 0.715 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
4.528AspAla: 4.528 ± 1.613
0.953AspCys: 0.953 ± 0.461
4.528AspAsp: 4.528 ± 1.613
2.383AspGlu: 2.383 ± 0.635
4.766AspPhe: 4.766 ± 1.916
1.907AspGly: 1.907 ± 0.26
0.953AspHis: 0.953 ± 0.364
3.098AspIle: 3.098 ± 1.498
4.29AspLys: 4.29 ± 2.074
7.865AspLeu: 7.865 ± 1.187
0.715AspMet: 0.715 ± 0.424
2.383AspAsn: 2.383 ± 0.635
1.907AspPro: 1.907 ± 0.454
1.43AspGln: 1.43 ± 0.849
4.29AspArg: 4.29 ± 1.064
5.481AspSer: 5.481 ± 0.496
4.29AspThr: 4.29 ± 0.956
3.813AspVal: 3.813 ± 0.204
0.0AspTrp: 0.0 ± 0.0
3.813AspTyr: 3.813 ± 1.843
0.0AspXaa: 0.0 ± 0.0
Glu
2.145GluAla: 2.145 ± 0.181
0.715GluCys: 0.715 ± 0.346
3.813GluAsp: 3.813 ± 0.909
4.051GluGlu: 4.051 ± 0.432
1.907GluPhe: 1.907 ± 0.839
1.668GluGly: 1.668 ± 0.564
1.43GluHis: 1.43 ± 1.032
6.196GluIle: 6.196 ± 0.841
3.098GluLys: 3.098 ± 0.398
5.481GluLeu: 5.481 ± 1.706
0.953GluMet: 0.953 ± 0.461
3.098GluAsn: 3.098 ± 1.458
2.622GluPro: 2.622 ± 1.198
1.668GluGln: 1.668 ± 0.782
2.145GluArg: 2.145 ± 1.41
3.813GluSer: 3.813 ± 3.901
2.383GluThr: 2.383 ± 1.303
3.575GluVal: 3.575 ± 1.506
1.907GluTrp: 1.907 ± 0.454
1.907GluTyr: 1.907 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
1.907PheAla: 1.907 ± 0.97
0.715PheCys: 0.715 ± 0.346
3.098PheAsp: 3.098 ± 1.689
2.145PheGlu: 2.145 ± 0.726
1.668PhePhe: 1.668 ± 0.782
1.668PheGly: 1.668 ± 0.385
1.668PheHis: 1.668 ± 0.782
2.145PheIle: 2.145 ± 0.726
1.907PheLys: 1.907 ± 0.839
2.383PheLeu: 2.383 ± 0.667
0.953PheMet: 0.953 ± 0.383
2.145PheAsn: 2.145 ± 0.688
3.575PhePro: 3.575 ± 0.834
4.051PheGln: 4.051 ± 0.28
2.383PheArg: 2.383 ± 0.615
4.051PheSer: 4.051 ± 0.71
2.383PheThr: 2.383 ± 0.635
3.575PheVal: 3.575 ± 1.302
0.238PheTrp: 0.238 ± 0.115
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.098GlyAla: 3.098 ± 1.509
1.192GlyCys: 1.192 ± 0.576
3.575GlyAsp: 3.575 ± 1.209
3.575GlyGlu: 3.575 ± 0.616
2.86GlyPhe: 2.86 ± 1.374
4.528GlyGly: 4.528 ± 1.738
0.953GlyHis: 0.953 ± 0.461
2.86GlyIle: 2.86 ± 0.694
2.86GlyLys: 2.86 ± 1.383
5.72GlyLeu: 5.72 ± 1.039
2.622GlyMet: 2.622 ± 0.206
1.907GlyAsn: 1.907 ± 0.63
2.145GlyPro: 2.145 ± 0.887
2.383GlyGln: 2.383 ± 0.156
1.668GlyArg: 1.668 ± 0.999
4.766GlySer: 4.766 ± 3.026
3.337GlyThr: 3.337 ± 1.128
3.575GlyVal: 3.575 ± 0.645
0.953GlyTrp: 0.953 ± 0.461
2.383GlyTyr: 2.383 ± 0.797
0.0GlyXaa: 0.0 ± 0.0
His
2.383HisAla: 2.383 ± 0.156
0.477HisCys: 0.477 ± 0.504
0.953HisAsp: 0.953 ± 0.461
1.668HisGlu: 1.668 ± 0.564
0.953HisPhe: 0.953 ± 1.009
1.907HisGly: 1.907 ± 0.26
0.238HisHis: 0.238 ± 0.115
0.715HisIle: 0.715 ± 0.346
1.43HisLys: 1.43 ± 0.515
2.383HisLeu: 2.383 ± 1.996
0.238HisMet: 0.238 ± 0.115
1.192HisAsn: 1.192 ± 1.179
1.43HisPro: 1.43 ± 0.341
0.715HisGln: 0.715 ± 0.346
1.43HisArg: 1.43 ± 0.849
1.192HisSer: 1.192 ± 0.576
1.43HisThr: 1.43 ± 0.341
1.668HisVal: 1.668 ± 0.952
1.43HisTrp: 1.43 ± 0.341
0.477HisTyr: 0.477 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.243IleAla: 5.243 ± 0.974
1.192IleCys: 1.192 ± 0.333
4.051IleAsp: 4.051 ± 0.991
3.337IleGlu: 3.337 ± 1.018
1.668IlePhe: 1.668 ± 0.806
2.622IleGly: 2.622 ± 0.998
1.907IleHis: 1.907 ± 0.922
3.098IleIle: 3.098 ± 0.398
2.86IleLys: 2.86 ± 1.383
4.29IleLeu: 4.29 ± 1.453
0.715IleMet: 0.715 ± 0.346
3.337IleAsn: 3.337 ± 1.838
3.098IlePro: 3.098 ± 0.398
2.622IleGln: 2.622 ± 1.772
1.907IleArg: 1.907 ± 0.63
5.72IleSer: 5.72 ± 0.109
3.098IleThr: 3.098 ± 0.398
5.005IleVal: 5.005 ± 1.961
0.477IleTrp: 0.477 ± 0.504
1.907IleTyr: 1.907 ± 0.922
0.0IleXaa: 0.0 ± 0.0
Lys
1.43LysAla: 1.43 ± 0.691
0.715LysCys: 0.715 ± 0.346
3.813LysAsp: 3.813 ± 0.729
4.051LysGlu: 4.051 ± 0.71
3.337LysPhe: 3.337 ± 1.056
2.86LysGly: 2.86 ± 0.99
0.477LysHis: 0.477 ± 0.23
4.766LysIle: 4.766 ± 0.511
2.86LysLys: 2.86 ± 0.84
5.481LysLeu: 5.481 ± 2.066
1.192LysMet: 1.192 ± 0.925
2.145LysAsn: 2.145 ± 1.037
1.907LysPro: 1.907 ± 0.922
1.907LysGln: 1.907 ± 0.26
2.383LysArg: 2.383 ± 0.635
5.958LysSer: 5.958 ± 1.966
3.575LysThr: 3.575 ± 1.728
4.29LysVal: 4.29 ± 1.023
0.477LysTrp: 0.477 ± 0.504
2.145LysTyr: 2.145 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
4.766LeuAla: 4.766 ± 1.727
2.622LeuCys: 2.622 ± 0.664
6.196LeuAsp: 6.196 ± 1.53
5.243LeuGlu: 5.243 ± 3.544
3.575LeuPhe: 3.575 ± 0.805
3.813LeuGly: 3.813 ± 0.729
1.192LeuHis: 1.192 ± 0.576
3.813LeuIle: 3.813 ± 1.4
5.72LeuLys: 5.72 ± 2.18
7.626LeuLeu: 7.626 ± 2.114
3.575LeuMet: 3.575 ± 0.834
4.29LeuAsn: 4.29 ± 0.58
7.15LeuPro: 7.15 ± 0.467
3.813LeuGln: 3.813 ± 2.09
5.243LeuArg: 5.243 ± 1.703
5.481LeuSer: 5.481 ± 2.987
6.673LeuThr: 6.673 ± 1.011
5.243LeuVal: 5.243 ± 0.54
0.953LeuTrp: 0.953 ± 0.461
1.668LeuTyr: 1.668 ± 0.385
0.0LeuXaa: 0.0 ± 0.0
Met
2.145MetAla: 2.145 ± 0.971
0.953MetCys: 0.953 ± 0.364
2.383MetAsp: 2.383 ± 0.635
1.192MetGlu: 1.192 ± 0.925
1.43MetPhe: 1.43 ± 0.691
1.668MetGly: 1.668 ± 1.769
0.477MetHis: 0.477 ± 0.23
0.715MetIle: 0.715 ± 0.346
1.907MetLys: 1.907 ± 0.922
2.383MetLeu: 2.383 ± 1.205
0.715MetMet: 0.715 ± 0.346
0.715MetAsn: 0.715 ± 0.346
1.43MetPro: 1.43 ± 0.849
0.715MetGln: 0.715 ± 0.346
0.953MetArg: 0.953 ± 0.461
0.953MetSer: 0.953 ± 1.009
0.477MetThr: 0.477 ± 0.23
1.43MetVal: 1.43 ± 0.341
0.0MetTrp: 0.0 ± 0.0
0.953MetTyr: 0.953 ± 0.461
0.0MetXaa: 0.0 ± 0.0
Asn
3.098AsnAla: 3.098 ± 1.074
0.477AsnCys: 0.477 ± 0.23
2.86AsnAsp: 2.86 ± 1.383
2.383AsnGlu: 2.383 ± 0.635
1.192AsnPhe: 1.192 ± 0.576
3.813AsnGly: 3.813 ± 1.41
1.43AsnHis: 1.43 ± 0.341
2.86AsnIle: 2.86 ± 0.4
2.622AsnLys: 2.622 ± 0.735
3.575AsnLeu: 3.575 ± 1.144
1.668AsnMet: 1.668 ± 0.359
1.192AsnAsn: 1.192 ± 0.489
1.907AsnPro: 1.907 ± 0.728
1.43AsnGln: 1.43 ± 0.849
2.86AsnArg: 2.86 ± 0.4
4.766AsnSer: 4.766 ± 1.657
2.145AsnThr: 2.145 ± 0.709
4.29AsnVal: 4.29 ± 1.064
0.477AsnTrp: 0.477 ± 0.23
1.907AsnTyr: 1.907 ± 0.922
0.0AsnXaa: 0.0 ± 0.0
Pro
1.192ProAla: 1.192 ± 0.333
0.477ProCys: 0.477 ± 0.23
3.813ProAsp: 3.813 ± 0.729
3.337ProGlu: 3.337 ± 0.219
1.907ProPhe: 1.907 ± 0.728
3.098ProGly: 3.098 ± 0.824
1.668ProHis: 1.668 ± 1.059
3.098ProIle: 3.098 ± 1.498
3.098ProLys: 3.098 ± 0.301
2.383ProLeu: 2.383 ± 0.156
0.715ProMet: 0.715 ± 0.346
3.098ProAsn: 3.098 ± 0.658
4.766ProPro: 4.766 ± 2.303
1.907ProGln: 1.907 ± 0.728
1.668ProArg: 1.668 ± 0.782
5.958ProSer: 5.958 ± 1.347
5.005ProThr: 5.005 ± 1.298
4.766ProVal: 4.766 ± 0.59
0.477ProTrp: 0.477 ± 0.23
0.953ProTyr: 0.953 ± 0.461
0.0ProXaa: 0.0 ± 0.0
Gln
2.86GlnAla: 2.86 ± 0.4
1.192GlnCys: 1.192 ± 0.576
2.383GlnAsp: 2.383 ± 1.205
2.86GlnGlu: 2.86 ± 2.485
1.668GlnPhe: 1.668 ± 0.359
1.43GlnGly: 1.43 ± 0.341
0.953GlnHis: 0.953 ± 0.687
2.86GlnIle: 2.86 ± 1.698
2.145GlnLys: 2.145 ± 0.54
3.337GlnLeu: 3.337 ± 1.018
0.715GlnMet: 0.715 ± 0.516
1.668GlnAsn: 1.668 ± 0.385
2.145GlnPro: 2.145 ± 0.181
1.192GlnGln: 1.192 ± 0.576
2.383GlnArg: 2.383 ± 1.151
3.575GlnSer: 3.575 ± 2.186
1.43GlnThr: 1.43 ± 0.691
2.145GlnVal: 2.145 ± 0.688
0.0GlnTrp: 0.0 ± 0.0
1.192GlnTyr: 1.192 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
5.005ArgAla: 5.005 ± 0.873
0.953ArgCys: 0.953 ± 1.009
2.622ArgAsp: 2.622 ± 0.664
2.145ArgGlu: 2.145 ± 0.726
1.907ArgPhe: 1.907 ± 0.839
2.86ArgGly: 2.86 ± 0.99
0.953ArgHis: 0.953 ± 1.009
3.575ArgIle: 3.575 ± 0.616
2.86ArgLys: 2.86 ± 0.931
2.86ArgLeu: 2.86 ± 0.84
1.907ArgMet: 1.907 ± 1.548
3.575ArgAsn: 3.575 ± 0.616
2.86ArgPro: 2.86 ± 1.608
2.383ArgGln: 2.383 ± 1.151
3.098ArgArg: 3.098 ± 0.824
5.481ArgSer: 5.481 ± 3.741
2.145ArgThr: 2.145 ± 1.598
3.098ArgVal: 3.098 ± 0.301
0.238ArgTrp: 0.238 ± 0.115
1.43ArgTyr: 1.43 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
4.051SerAla: 4.051 ± 2.442
2.86SerCys: 2.86 ± 1.383
4.766SerAsp: 4.766 ± 1.27
4.766SerGlu: 4.766 ± 1.867
2.383SerPhe: 2.383 ± 0.615
4.766SerGly: 4.766 ± 4.306
1.43SerHis: 1.43 ± 0.341
5.243SerIle: 5.243 ± 1.619
6.196SerLys: 6.196 ± 0.125
9.295SerLeu: 9.295 ± 2.867
0.715SerMet: 0.715 ± 0.424
3.098SerAsn: 3.098 ± 1.509
3.098SerPro: 3.098 ± 1.092
3.337SerGln: 3.337 ± 1.018
5.958SerArg: 5.958 ± 6.253
8.58SerSer: 8.58 ± 4.198
6.196SerThr: 6.196 ± 1.396
5.958SerVal: 5.958 ± 2.123
1.192SerTrp: 1.192 ± 1.179
2.622SerTyr: 2.622 ± 1.041
0.0SerXaa: 0.0 ± 0.0
Thr
2.86ThrAla: 2.86 ± 1.383
0.953ThrCys: 0.953 ± 0.461
3.337ThrAsp: 3.337 ± 1.128
1.43ThrGlu: 1.43 ± 0.341
3.337ThrPhe: 3.337 ± 0.506
4.528ThrGly: 4.528 ± 1.946
1.907ThrHis: 1.907 ± 0.97
3.813ThrIle: 3.813 ± 0.729
2.145ThrLys: 2.145 ± 0.54
6.196ThrLeu: 6.196 ± 1.306
1.668ThrMet: 1.668 ± 0.806
2.86ThrAsn: 2.86 ± 1.383
3.575ThrPro: 3.575 ± 1.302
0.953ThrGln: 0.953 ± 0.461
2.383ThrArg: 2.383 ± 0.156
6.435ThrSer: 6.435 ± 2.66
3.813ThrThr: 3.813 ± 1.41
4.051ThrVal: 4.051 ± 1.388
1.907ThrTrp: 1.907 ± 0.26
1.907ThrTyr: 1.907 ± 0.922
0.0ThrXaa: 0.0 ± 0.0
Val
5.005ValAla: 5.005 ± 0.701
0.715ValCys: 0.715 ± 0.346
4.051ValAsp: 4.051 ± 0.991
3.813ValGlu: 3.813 ± 0.719
2.383ValPhe: 2.383 ± 1.205
3.337ValGly: 3.337 ± 1.196
2.622ValHis: 2.622 ± 1.144
3.813ValIle: 3.813 ± 1.276
3.813ValLys: 3.813 ± 1.843
6.673ValLeu: 6.673 ± 0.492
1.43ValMet: 1.43 ± 0.341
3.813ValAsn: 3.813 ± 1.276
3.813ValPro: 3.813 ± 0.729
2.145ValGln: 2.145 ± 0.726
3.098ValArg: 3.098 ± 0.994
6.196ValSer: 6.196 ± 2.685
3.098ValThr: 3.098 ± 0.658
5.481ValVal: 5.481 ± 0.926
0.715ValTrp: 0.715 ± 0.346
2.383ValTyr: 2.383 ± 1.152
0.0ValXaa: 0.0 ± 0.0
Trp
1.43TrpAla: 1.43 ± 1.6
0.0TrpCys: 0.0 ± 0.0
0.238TrpAsp: 0.238 ± 0.115
1.192TrpGlu: 1.192 ± 0.333
0.715TrpPhe: 0.715 ± 0.424
1.192TrpGly: 1.192 ± 0.333
0.0TrpHis: 0.0 ± 0.0
0.715TrpIle: 0.715 ± 0.346
0.238TrpLys: 0.238 ± 0.115
0.953TrpLeu: 0.953 ± 0.461
0.0TrpMet: 0.0 ± 0.0
0.238TrpAsn: 0.238 ± 0.115
0.477TrpPro: 0.477 ± 0.23
0.238TrpGln: 0.238 ± 0.115
1.668TrpArg: 1.668 ± 0.782
0.715TrpSer: 0.715 ± 0.346
1.668TrpThr: 1.668 ± 0.806
0.238TrpVal: 0.238 ± 0.115
0.0TrpTrp: 0.0 ± 0.0
0.953TrpTyr: 0.953 ± 0.364
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.43TyrAla: 1.43 ± 0.515
1.43TyrCys: 1.43 ± 0.341
2.145TyrAsp: 2.145 ± 1.037
1.192TyrGlu: 1.192 ± 0.576
1.668TyrPhe: 1.668 ± 0.385
3.098TyrGly: 3.098 ± 1.498
1.907TyrHis: 1.907 ± 1.348
0.715TyrIle: 0.715 ± 0.346
1.668TyrLys: 1.668 ± 0.806
2.622TyrLeu: 2.622 ± 0.735
0.238TyrMet: 0.238 ± 0.115
2.383TyrAsn: 2.383 ± 1.152
1.907TyrPro: 1.907 ± 0.63
1.192TyrGln: 1.192 ± 0.576
1.668TyrArg: 1.668 ± 0.385
2.145TyrSer: 2.145 ± 1.037
1.192TyrThr: 1.192 ± 0.576
2.383TyrVal: 2.383 ± 0.635
0.238TyrTrp: 0.238 ± 0.115
1.43TyrTyr: 1.43 ± 0.849
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4197 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski