Amino acid dipepetide frequency for Hubei picorna-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.495AlaAla: 3.495 ± 0.727
0.777AlaCys: 0.777 ± 0.234
3.107AlaAsp: 3.107 ± 1.585
1.942AlaGlu: 1.942 ± 0.259
3.883AlaPhe: 3.883 ± 1.819
1.553AlaGly: 1.553 ± 0.467
1.553AlaHis: 1.553 ± 0.832
3.883AlaIle: 3.883 ± 0.781
3.883AlaLys: 3.883 ± 0.519
4.272AlaLeu: 4.272 ± 1.61
0.777AlaMet: 0.777 ± 0.189
3.883AlaAsn: 3.883 ± 3.118
1.165AlaPro: 1.165 ± 0.676
3.495AlaGln: 3.495 ± 2.027
2.33AlaArg: 2.33 ± 0.599
3.107AlaSer: 3.107 ± 0.935
3.107AlaThr: 3.107 ± 0.285
1.942AlaVal: 1.942 ± 0.391
0.0AlaTrp: 0.0 ± 0.0
2.33AlaTyr: 2.33 ± 0.701
0.0AlaXaa: 0.0 ± 0.0
Cys
1.942CysAla: 1.942 ± 0.391
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.553CysGlu: 1.553 ± 0.467
1.553CysPhe: 1.553 ± 0.467
1.553CysGly: 1.553 ± 0.467
0.0CysHis: 0.0 ± 0.0
1.165CysIle: 1.165 ± 0.026
0.777CysLys: 0.777 ± 0.416
1.942CysLeu: 1.942 ± 1.041
0.0CysMet: 0.0 ± 0.0
2.33CysAsn: 2.33 ± 1.249
1.165CysPro: 1.165 ± 0.676
0.0CysGln: 0.0 ± 0.0
0.777CysArg: 0.777 ± 0.416
0.388CysSer: 0.388 ± 0.208
0.777CysThr: 0.777 ± 0.234
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.33AspAla: 2.33 ± 0.051
1.942AspCys: 1.942 ± 0.259
5.825AspAsp: 5.825 ± 1.172
4.66AspGlu: 4.66 ± 1.847
3.107AspPhe: 3.107 ± 1.015
3.495AspGly: 3.495 ± 1.223
0.388AspHis: 0.388 ± 0.208
8.932AspIle: 8.932 ± 0.237
3.495AspLys: 3.495 ± 1.223
4.66AspLeu: 4.66 ± 0.547
1.553AspMet: 1.553 ± 0.182
2.718AspAsn: 2.718 ± 0.157
1.165AspPro: 1.165 ± 0.026
1.165AspGln: 1.165 ± 0.676
1.553AspArg: 1.553 ± 0.182
1.942AspSer: 1.942 ± 0.259
3.495AspThr: 3.495 ± 0.077
1.553AspVal: 1.553 ± 0.182
1.165AspTrp: 1.165 ± 0.676
1.165AspTyr: 1.165 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
2.33GluAla: 2.33 ± 0.701
0.388GluCys: 0.388 ± 0.208
3.495GluAsp: 3.495 ± 1.223
3.883GluGlu: 3.883 ± 1.431
2.33GluPhe: 2.33 ± 0.599
1.942GluGly: 1.942 ± 0.259
1.165GluHis: 1.165 ± 0.026
6.99GluIle: 6.99 ± 2.446
5.437GluLys: 5.437 ± 0.964
3.495GluLeu: 3.495 ± 1.223
3.107GluMet: 3.107 ± 1.015
3.495GluAsn: 3.495 ± 1.873
0.388GluPro: 0.388 ± 0.208
2.718GluGln: 2.718 ± 0.807
0.388GluArg: 0.388 ± 0.208
4.272GluSer: 4.272 ± 0.311
3.107GluThr: 3.107 ± 0.365
3.107GluVal: 3.107 ± 1.585
0.388GluTrp: 0.388 ± 0.442
1.165GluTyr: 1.165 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
3.107PheAla: 3.107 ± 1.585
0.0PheCys: 0.0 ± 0.0
3.107PheAsp: 3.107 ± 0.285
2.718PheGlu: 2.718 ± 0.807
2.33PhePhe: 2.33 ± 0.599
3.107PheGly: 3.107 ± 0.285
0.777PheHis: 0.777 ± 0.884
2.718PheIle: 2.718 ± 0.157
6.214PheLys: 6.214 ± 2.68
1.942PheLeu: 1.942 ± 1.041
1.553PheMet: 1.553 ± 0.832
3.495PheAsn: 3.495 ± 0.573
1.942PhePro: 1.942 ± 0.259
1.553PheGln: 1.553 ± 0.467
1.165PheArg: 1.165 ± 0.676
3.883PheSer: 3.883 ± 1.431
1.942PheThr: 1.942 ± 0.391
2.718PheVal: 2.718 ± 1.793
0.777PheTrp: 0.777 ± 0.416
1.165PheTyr: 1.165 ± 0.624
0.0PheXaa: 0.0 ± 0.0
Gly
2.718GlyAla: 2.718 ± 0.493
0.777GlyCys: 0.777 ± 0.416
4.272GlyAsp: 4.272 ± 0.311
2.33GlyGlu: 2.33 ± 0.701
3.495GlyPhe: 3.495 ± 0.727
3.495GlyGly: 3.495 ± 0.077
1.553GlyHis: 1.553 ± 0.832
3.495GlyIle: 3.495 ± 0.573
4.66GlyLys: 4.66 ± 1.847
5.049GlyLeu: 5.049 ± 2.494
1.165GlyMet: 1.165 ± 0.472
3.495GlyAsn: 3.495 ± 1.223
1.165GlyPro: 1.165 ± 0.026
2.33GlyGln: 2.33 ± 0.051
1.942GlyArg: 1.942 ± 0.909
3.107GlySer: 3.107 ± 0.935
5.049GlyThr: 5.049 ± 3.144
3.107GlyVal: 3.107 ± 1.585
0.0GlyTrp: 0.0 ± 0.0
3.107GlyTyr: 3.107 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 0.182
0.777HisCys: 0.777 ± 0.234
0.388HisAsp: 0.388 ± 0.208
0.388HisGlu: 0.388 ± 0.208
1.553HisPhe: 1.553 ± 0.832
3.107HisGly: 3.107 ± 0.365
1.165HisHis: 1.165 ± 0.624
1.165HisIle: 1.165 ± 0.624
0.777HisLys: 0.777 ± 0.234
2.718HisLeu: 2.718 ± 0.807
1.165HisMet: 1.165 ± 0.026
1.553HisAsn: 1.553 ± 0.832
0.0HisPro: 0.0 ± 0.0
1.165HisGln: 1.165 ± 0.026
1.553HisArg: 1.553 ± 0.182
1.553HisSer: 1.553 ± 0.832
1.165HisThr: 1.165 ± 0.026
1.165HisVal: 1.165 ± 0.026
0.0HisTrp: 0.0 ± 0.0
1.553HisTyr: 1.553 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
3.883IleAla: 3.883 ± 0.131
1.553IleCys: 1.553 ± 0.182
5.437IleAsp: 5.437 ± 1.636
5.825IleGlu: 5.825 ± 1.822
3.107IlePhe: 3.107 ± 0.285
5.049IleGly: 5.049 ± 0.544
2.718IleHis: 2.718 ± 1.457
5.437IleIle: 5.437 ± 0.964
6.602IleLys: 6.602 ± 2.238
6.99IleLeu: 6.99 ± 1.146
2.718IleMet: 2.718 ± 0.807
5.825IleAsn: 5.825 ± 1.172
4.272IlePro: 4.272 ± 2.91
1.165IleGln: 1.165 ± 0.026
4.66IleArg: 4.66 ± 0.547
6.99IleSer: 6.99 ± 0.496
3.495IleThr: 3.495 ± 0.077
6.99IleVal: 6.99 ± 0.804
0.388IleTrp: 0.388 ± 0.442
4.66IleTyr: 4.66 ± 0.547
0.0IleXaa: 0.0 ± 0.0
Lys
1.165LysAla: 1.165 ± 0.026
1.553LysCys: 1.553 ± 0.832
3.107LysAsp: 3.107 ± 1.015
3.107LysGlu: 3.107 ± 1.665
3.495LysPhe: 3.495 ± 0.573
3.883LysGly: 3.883 ± 2.081
1.942LysHis: 1.942 ± 1.041
8.544LysIle: 8.544 ± 0.029
6.602LysLys: 6.602 ± 3.538
8.932LysLeu: 8.932 ± 3.487
2.33LysMet: 2.33 ± 0.701
6.214LysAsn: 6.214 ± 2.03
3.107LysPro: 3.107 ± 0.935
3.107LysGln: 3.107 ± 1.015
2.33LysArg: 2.33 ± 0.599
2.718LysSer: 2.718 ± 0.157
4.272LysThr: 4.272 ± 0.339
3.107LysVal: 3.107 ± 1.015
0.777LysTrp: 0.777 ± 0.416
6.214LysTyr: 6.214 ± 2.68
0.0LysXaa: 0.0 ± 0.0
Leu
2.33LeuAla: 2.33 ± 1.351
1.553LeuCys: 1.553 ± 0.832
1.165LeuAsp: 1.165 ± 0.624
4.272LeuGlu: 4.272 ± 2.289
3.107LeuPhe: 3.107 ± 0.365
2.718LeuGly: 2.718 ± 0.807
3.883LeuHis: 3.883 ± 0.781
6.602LeuIle: 6.602 ± 0.938
6.214LeuLys: 6.214 ± 2.68
7.767LeuLeu: 7.767 ± 2.862
1.553LeuMet: 1.553 ± 0.832
6.602LeuAsn: 6.602 ± 1.588
3.883LeuPro: 3.883 ± 0.519
3.495LeuGln: 3.495 ± 0.077
3.495LeuArg: 3.495 ± 1.377
8.932LeuSer: 8.932 ± 2.363
3.883LeuThr: 3.883 ± 0.131
5.049LeuVal: 5.049 ± 0.106
2.33LeuTrp: 2.33 ± 0.051
3.495LeuTyr: 3.495 ± 1.223
0.0LeuXaa: 0.0 ± 0.0
Met
2.718MetAla: 2.718 ± 0.807
0.388MetCys: 0.388 ± 0.208
0.0MetAsp: 0.0 ± 0.0
2.33MetGlu: 2.33 ± 0.599
2.33MetPhe: 2.33 ± 0.051
1.165MetGly: 1.165 ± 0.676
1.553MetHis: 1.553 ± 0.832
1.942MetIle: 1.942 ± 2.209
3.107MetLys: 3.107 ± 1.015
1.942MetLeu: 1.942 ± 1.041
0.777MetMet: 0.777 ± 0.416
1.553MetAsn: 1.553 ± 0.182
0.0MetPro: 0.0 ± 0.0
0.777MetGln: 0.777 ± 0.234
0.388MetArg: 0.388 ± 0.208
2.33MetSer: 2.33 ± 0.051
1.942MetThr: 1.942 ± 1.041
1.165MetVal: 1.165 ± 0.026
0.388MetTrp: 0.388 ± 0.208
0.777MetTyr: 0.777 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
4.66AsnAla: 4.66 ± 2.052
1.165AsnCys: 1.165 ± 0.676
6.602AsnAsp: 6.602 ± 1.588
5.437AsnGlu: 5.437 ± 0.336
4.66AsnPhe: 4.66 ± 1.197
4.272AsnGly: 4.272 ± 0.311
0.388AsnHis: 0.388 ± 0.208
6.602AsnIle: 6.602 ± 1.588
5.049AsnLys: 5.049 ± 1.406
5.825AsnLeu: 5.825 ± 0.778
1.553AsnMet: 1.553 ± 0.832
5.049AsnAsn: 5.049 ± 0.756
2.33AsnPro: 2.33 ± 0.701
2.33AsnGln: 2.33 ± 1.351
1.942AsnArg: 1.942 ± 1.041
6.214AsnSer: 6.214 ± 1.38
3.495AsnThr: 3.495 ± 0.077
2.718AsnVal: 2.718 ± 0.493
1.165AsnTrp: 1.165 ± 0.026
3.107AsnTyr: 3.107 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
2.33ProAla: 2.33 ± 0.701
0.388ProCys: 0.388 ± 0.442
0.777ProAsp: 0.777 ± 0.234
1.942ProGlu: 1.942 ± 0.259
1.942ProPhe: 1.942 ± 0.259
3.107ProGly: 3.107 ± 0.285
0.388ProHis: 0.388 ± 0.442
1.553ProIle: 1.553 ± 0.182
1.942ProLys: 1.942 ± 0.391
3.495ProLeu: 3.495 ± 0.077
0.777ProMet: 0.777 ± 0.234
2.33ProAsn: 2.33 ± 1.351
1.165ProPro: 1.165 ± 1.326
1.553ProGln: 1.553 ± 0.467
0.777ProArg: 0.777 ± 0.884
2.718ProSer: 2.718 ± 0.807
2.718ProThr: 2.718 ± 1.793
3.883ProVal: 3.883 ± 3.118
0.777ProTrp: 0.777 ± 0.234
2.718ProTyr: 2.718 ± 2.443
0.0ProXaa: 0.0 ± 0.0
Gln
2.33GlnAla: 2.33 ± 0.701
0.777GlnCys: 0.777 ± 0.416
2.33GlnAsp: 2.33 ± 0.599
1.942GlnGlu: 1.942 ± 1.559
1.553GlnPhe: 1.553 ± 0.182
0.777GlnGly: 0.777 ± 0.884
0.388GlnHis: 0.388 ± 0.442
3.495GlnIle: 3.495 ± 0.727
2.718GlnLys: 2.718 ± 0.157
2.33GlnLeu: 2.33 ± 0.599
1.165GlnMet: 1.165 ± 0.676
1.553GlnAsn: 1.553 ± 0.467
1.553GlnPro: 1.553 ± 0.182
1.165GlnGln: 1.165 ± 0.026
2.33GlnArg: 2.33 ± 0.051
0.777GlnSer: 0.777 ± 0.416
1.553GlnThr: 1.553 ± 0.467
3.107GlnVal: 3.107 ± 1.585
0.777GlnTrp: 0.777 ± 0.416
2.33GlnTyr: 2.33 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
1.942ArgAla: 1.942 ± 0.909
0.0ArgCys: 0.0 ± 0.0
1.942ArgAsp: 1.942 ± 0.391
0.388ArgGlu: 0.388 ± 0.208
0.388ArgPhe: 0.388 ± 0.442
2.718ArgGly: 2.718 ± 1.143
1.165ArgHis: 1.165 ± 0.624
3.883ArgIle: 3.883 ± 0.131
2.718ArgLys: 2.718 ± 1.457
3.495ArgLeu: 3.495 ± 0.727
1.165ArgMet: 1.165 ± 0.026
2.718ArgAsn: 2.718 ± 0.493
0.777ArgPro: 0.777 ± 0.416
2.33ArgGln: 2.33 ± 0.051
0.388ArgArg: 0.388 ± 0.442
1.942ArgSer: 1.942 ± 1.559
2.33ArgThr: 2.33 ± 0.701
1.942ArgVal: 1.942 ± 1.041
0.388ArgTrp: 0.388 ± 0.208
2.33ArgTyr: 2.33 ± 0.599
0.0ArgXaa: 0.0 ± 0.0
Ser
4.272SerAla: 4.272 ± 2.91
1.165SerCys: 1.165 ± 0.676
4.66SerAsp: 4.66 ± 1.847
1.165SerGlu: 1.165 ± 0.624
2.718SerPhe: 2.718 ± 1.457
3.883SerGly: 3.883 ± 1.169
0.777SerHis: 0.777 ± 0.884
8.155SerIle: 8.155 ± 0.471
4.272SerLys: 4.272 ± 0.989
7.379SerLeu: 7.379 ± 1.354
2.718SerMet: 2.718 ± 0.157
4.66SerAsn: 4.66 ± 1.402
4.66SerPro: 4.66 ± 1.402
2.718SerGln: 2.718 ± 0.493
1.553SerArg: 1.553 ± 0.182
5.437SerSer: 5.437 ± 0.336
3.883SerThr: 3.883 ± 3.118
1.165SerVal: 1.165 ± 0.026
0.0SerTrp: 0.0 ± 0.0
4.272SerTyr: 4.272 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
2.33ThrAla: 2.33 ± 1.351
0.777ThrCys: 0.777 ± 0.234
3.495ThrAsp: 3.495 ± 0.727
1.942ThrGlu: 1.942 ± 1.041
1.942ThrPhe: 1.942 ± 0.259
3.883ThrGly: 3.883 ± 2.469
1.942ThrHis: 1.942 ± 0.259
4.272ThrIle: 4.272 ± 0.311
3.495ThrLys: 3.495 ± 0.077
2.33ThrLeu: 2.33 ± 0.599
1.553ThrMet: 1.553 ± 0.182
4.66ThrAsn: 4.66 ± 0.752
4.272ThrPro: 4.272 ± 2.26
1.942ThrGln: 1.942 ± 0.391
2.33ThrArg: 2.33 ± 0.051
4.272ThrSer: 4.272 ± 1.61
3.495ThrThr: 3.495 ± 1.377
4.272ThrVal: 4.272 ± 2.26
0.388ThrTrp: 0.388 ± 0.442
2.33ThrTyr: 2.33 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
1.942ValAla: 1.942 ± 0.909
1.165ValCys: 1.165 ± 0.624
1.942ValAsp: 1.942 ± 0.391
3.883ValGlu: 3.883 ± 0.131
0.777ValPhe: 0.777 ± 0.416
4.66ValGly: 4.66 ± 2.702
0.777ValHis: 0.777 ± 0.416
5.437ValIle: 5.437 ± 0.314
3.883ValLys: 3.883 ± 1.169
3.883ValLeu: 3.883 ± 0.131
1.165ValMet: 1.165 ± 0.624
4.66ValAsn: 4.66 ± 0.752
1.942ValPro: 1.942 ± 2.209
0.777ValGln: 0.777 ± 0.234
1.942ValArg: 1.942 ± 0.259
5.437ValSer: 5.437 ± 2.936
1.942ValThr: 1.942 ± 2.209
1.942ValVal: 1.942 ± 1.559
0.0ValTrp: 0.0 ± 0.0
4.66ValTyr: 4.66 ± 0.752
0.0ValXaa: 0.0 ± 0.0
Trp
1.165TrpAla: 1.165 ± 0.026
0.0TrpCys: 0.0 ± 0.0
0.777TrpAsp: 0.777 ± 0.416
0.388TrpGlu: 0.388 ± 0.442
0.777TrpPhe: 0.777 ± 0.416
0.0TrpGly: 0.0 ± 0.0
0.388TrpHis: 0.388 ± 0.208
0.777TrpIle: 0.777 ± 0.234
0.388TrpLys: 0.388 ± 0.208
1.553TrpLeu: 1.553 ± 0.182
0.0TrpMet: 0.0 ± 0.0
0.388TrpAsn: 0.388 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.777TrpArg: 0.777 ± 0.234
0.777TrpSer: 0.777 ± 0.234
0.777TrpThr: 0.777 ± 0.234
1.165TrpVal: 1.165 ± 0.026
0.0TrpTrp: 0.0 ± 0.0
0.777TrpTyr: 0.777 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.33TyrAla: 2.33 ± 0.599
0.777TyrCys: 0.777 ± 0.416
3.883TyrAsp: 3.883 ± 1.431
3.495TyrGlu: 3.495 ± 1.223
1.165TyrPhe: 1.165 ± 0.026
1.942TyrGly: 1.942 ± 0.391
1.553TyrHis: 1.553 ± 0.182
2.718TyrIle: 2.718 ± 1.143
4.272TyrLys: 4.272 ± 1.639
2.33TyrLeu: 2.33 ± 0.051
0.388TyrMet: 0.388 ± 0.208
7.379TyrAsn: 7.379 ± 0.596
2.33TyrPro: 2.33 ± 0.051
1.165TyrGln: 1.165 ± 0.676
2.33TyrArg: 2.33 ± 0.701
2.718TyrSer: 2.718 ± 0.157
3.495TyrThr: 3.495 ± 0.727
2.718TyrVal: 2.718 ± 0.493
1.165TyrTrp: 1.165 ± 0.624
3.495TyrTyr: 3.495 ± 2.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2576 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski