Amino acid dipepetide frequency for Hubei picorna-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.484AlaAla: 9.484 ± 0.385
1.517AlaCys: 1.517 ± 0.511
4.173AlaAsp: 4.173 ± 0.873
5.69AlaGlu: 5.69 ± 0.29
3.794AlaPhe: 3.794 ± 0.675
6.829AlaGly: 6.829 ± 0.347
1.517AlaHis: 1.517 ± 0.791
5.311AlaIle: 5.311 ± 0.487
3.414AlaLys: 3.414 ± 1.779
6.07AlaLeu: 6.07 ± 1.394
3.035AlaMet: 3.035 ± 1.022
1.517AlaAsn: 1.517 ± 1.162
5.311AlaPro: 5.311 ± 1.789
3.414AlaGln: 3.414 ± 0.477
5.69AlaArg: 5.69 ± 0.361
6.07AlaSer: 6.07 ± 0.743
3.414AlaThr: 3.414 ± 1.476
7.208AlaVal: 7.208 ± 1.152
0.379AlaTrp: 0.379 ± 0.198
3.414AlaTyr: 3.414 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
1.897CysAla: 1.897 ± 0.337
0.379CysCys: 0.379 ± 0.198
0.759CysAsp: 0.759 ± 0.395
0.759CysGlu: 0.759 ± 0.395
0.379CysPhe: 0.379 ± 0.198
1.138CysGly: 1.138 ± 0.058
0.379CysHis: 0.379 ± 0.198
0.759CysIle: 0.759 ± 0.395
1.517CysLys: 1.517 ± 0.791
1.897CysLeu: 1.897 ± 0.337
0.759CysMet: 0.759 ± 0.256
0.379CysAsn: 0.379 ± 0.198
0.759CysPro: 0.759 ± 0.395
0.379CysGln: 0.379 ± 0.453
0.379CysArg: 0.379 ± 0.198
1.138CysSer: 1.138 ± 0.058
0.759CysThr: 0.759 ± 0.395
0.759CysVal: 0.759 ± 0.395
0.0CysTrp: 0.0 ± 0.0
0.759CysTyr: 0.759 ± 0.256
0.0CysXaa: 0.0 ± 0.0
Asp
6.449AspAla: 6.449 ± 1.847
1.897AspCys: 1.897 ± 0.337
4.552AspAsp: 4.552 ± 1.721
4.932AspGlu: 4.932 ± 1.919
3.414AspPhe: 3.414 ± 0.174
3.414AspGly: 3.414 ± 0.477
0.0AspHis: 0.0 ± 0.0
0.759AspIle: 0.759 ± 0.395
1.897AspLys: 1.897 ± 0.314
6.829AspLeu: 6.829 ± 0.955
0.759AspMet: 0.759 ± 0.256
1.897AspAsn: 1.897 ± 0.337
1.517AspPro: 1.517 ± 0.511
1.897AspGln: 1.897 ± 0.314
1.517AspArg: 1.517 ± 0.791
4.552AspSer: 4.552 ± 0.419
2.656AspThr: 2.656 ± 1.22
6.449AspVal: 6.449 ± 1.847
1.517AspTrp: 1.517 ± 0.791
1.897AspTyr: 1.897 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
3.414GluAla: 3.414 ± 1.779
0.379GluCys: 0.379 ± 0.198
3.414GluAsp: 3.414 ± 0.174
3.414GluGlu: 3.414 ± 1.779
5.311GluPhe: 5.311 ± 2.117
2.656GluGly: 2.656 ± 1.384
1.517GluHis: 1.517 ± 0.14
2.656GluIle: 2.656 ± 0.733
1.897GluLys: 1.897 ± 0.988
5.69GluLeu: 5.69 ± 1.012
1.517GluMet: 1.517 ± 0.511
2.656GluAsn: 2.656 ± 0.082
1.517GluPro: 1.517 ± 0.791
2.276GluGln: 2.276 ± 1.186
1.517GluArg: 1.517 ± 0.791
2.656GluSer: 2.656 ± 0.733
1.138GluThr: 1.138 ± 0.709
5.311GluVal: 5.311 ± 0.815
1.138GluTrp: 1.138 ± 0.593
2.276GluTyr: 2.276 ± 0.535
0.0GluXaa: 0.0 ± 0.0
Phe
2.276PheAla: 2.276 ± 0.116
0.379PheCys: 0.379 ± 0.198
3.794PheAsp: 3.794 ± 1.278
3.035PheGlu: 3.035 ± 0.931
2.276PhePhe: 2.276 ± 0.535
4.173PheGly: 4.173 ± 1.08
0.379PheHis: 0.379 ± 0.198
1.897PheIle: 1.897 ± 0.988
5.311PheLys: 5.311 ± 0.815
2.656PheLeu: 2.656 ± 0.082
1.517PheMet: 1.517 ± 0.791
2.276PheAsn: 2.276 ± 0.116
3.035PhePro: 3.035 ± 0.371
4.173PheGln: 4.173 ± 0.429
3.794PheArg: 3.794 ± 0.024
3.794PheSer: 3.794 ± 0.024
3.794PheThr: 3.794 ± 0.627
3.035PheVal: 3.035 ± 1.022
0.759PheTrp: 0.759 ± 0.395
1.897PheTyr: 1.897 ± 0.965
0.0PheXaa: 0.0 ± 0.0
Gly
3.035GlyAla: 3.035 ± 0.371
1.138GlyCys: 1.138 ± 0.593
6.07GlyAsp: 6.07 ± 1.21
2.656GlyGlu: 2.656 ± 0.733
1.897GlyPhe: 1.897 ± 0.988
2.656GlyGly: 2.656 ± 0.569
0.759GlyHis: 0.759 ± 0.256
2.656GlyIle: 2.656 ± 0.082
4.932GlyLys: 4.932 ± 0.685
1.517GlyLeu: 1.517 ± 0.14
1.138GlyMet: 1.138 ± 0.593
3.794GlyAsn: 3.794 ± 1.929
3.035GlyPro: 3.035 ± 1.022
1.138GlyGln: 1.138 ± 0.593
4.173GlyArg: 4.173 ± 0.222
4.932GlySer: 4.932 ± 1.987
3.794GlyThr: 3.794 ± 0.627
4.932GlyVal: 4.932 ± 0.685
0.759GlyTrp: 0.759 ± 0.256
4.173GlyTyr: 4.173 ± 1.524
0.0GlyXaa: 0.0 ± 0.0
His
1.897HisAla: 1.897 ± 0.337
0.379HisCys: 0.379 ± 0.198
1.138HisAsp: 1.138 ± 0.593
0.0HisGlu: 0.0 ± 0.0
1.517HisPhe: 1.517 ± 0.791
0.379HisGly: 0.379 ± 0.198
0.379HisHis: 0.379 ± 0.198
1.897HisIle: 1.897 ± 0.314
0.0HisLys: 0.0 ± 0.0
1.517HisLeu: 1.517 ± 0.14
0.0HisMet: 0.0 ± 0.0
0.759HisAsn: 0.759 ± 0.395
1.138HisPro: 1.138 ± 0.058
0.379HisGln: 0.379 ± 0.198
1.138HisArg: 1.138 ± 0.058
1.517HisSer: 1.517 ± 0.14
0.379HisThr: 0.379 ± 0.198
3.035HisVal: 3.035 ± 0.28
0.379HisTrp: 0.379 ± 0.198
0.759HisTyr: 0.759 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
6.07IleAla: 6.07 ± 0.559
1.138IleCys: 1.138 ± 0.593
2.276IleAsp: 2.276 ± 0.116
2.276IleGlu: 2.276 ± 1.186
1.897IlePhe: 1.897 ± 0.988
3.035IleGly: 3.035 ± 1.582
0.379IleHis: 0.379 ± 0.198
1.897IleIle: 1.897 ± 0.337
1.897IleLys: 1.897 ± 0.988
2.656IleLeu: 2.656 ± 0.082
0.759IleMet: 0.759 ± 0.395
3.414IleAsn: 3.414 ± 1.476
2.276IlePro: 2.276 ± 1.418
1.517IleGln: 1.517 ± 1.162
1.897IleArg: 1.897 ± 0.314
4.932IleSer: 4.932 ± 0.034
2.276IleThr: 2.276 ± 1.186
2.656IleVal: 2.656 ± 0.733
1.138IleTrp: 1.138 ± 0.593
2.276IleTyr: 2.276 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
4.932LysAla: 4.932 ± 1.919
0.0LysCys: 0.0 ± 0.0
4.173LysAsp: 4.173 ± 0.873
3.414LysGlu: 3.414 ± 1.128
1.138LysPhe: 1.138 ± 0.058
2.656LysGly: 2.656 ± 0.733
1.897LysHis: 1.897 ± 0.988
2.656LysIle: 2.656 ± 1.384
6.449LysLys: 6.449 ± 3.361
3.794LysLeu: 3.794 ± 0.675
3.414LysMet: 3.414 ± 0.174
2.656LysAsn: 2.656 ± 0.733
2.656LysPro: 2.656 ± 1.22
1.517LysGln: 1.517 ± 0.14
3.794LysArg: 3.794 ± 1.977
3.035LysSer: 3.035 ± 1.582
3.035LysThr: 3.035 ± 0.931
3.414LysVal: 3.414 ± 0.825
1.517LysTrp: 1.517 ± 0.511
3.794LysTyr: 3.794 ± 0.627
0.0LysXaa: 0.0 ± 0.0
Leu
6.449LeuAla: 6.449 ± 0.106
0.759LeuCys: 0.759 ± 0.395
4.173LeuAsp: 4.173 ± 0.429
5.69LeuGlu: 5.69 ± 0.29
0.379LeuPhe: 0.379 ± 0.453
5.311LeuGly: 5.311 ± 0.815
1.138LeuHis: 1.138 ± 0.058
2.276LeuIle: 2.276 ± 1.186
4.173LeuLys: 4.173 ± 0.429
5.311LeuLeu: 5.311 ± 0.164
1.517LeuMet: 1.517 ± 0.791
3.794LeuAsn: 3.794 ± 0.675
4.932LeuPro: 4.932 ± 1.987
2.276LeuGln: 2.276 ± 0.535
4.173LeuArg: 4.173 ± 1.524
7.587LeuSer: 7.587 ± 1.905
4.932LeuThr: 4.932 ± 1.336
6.07LeuVal: 6.07 ± 1.861
1.517LeuTrp: 1.517 ± 0.791
3.414LeuTyr: 3.414 ± 0.825
0.0LeuXaa: 0.0 ± 0.0
Met
3.035MetAla: 3.035 ± 0.28
0.0MetCys: 0.0 ± 0.0
0.759MetAsp: 0.759 ± 0.256
1.138MetGlu: 1.138 ± 0.593
1.897MetPhe: 1.897 ± 0.314
1.517MetGly: 1.517 ± 1.162
1.517MetHis: 1.517 ± 0.511
1.897MetIle: 1.897 ± 0.988
0.759MetLys: 0.759 ± 0.256
1.138MetLeu: 1.138 ± 0.058
0.379MetMet: 0.379 ± 0.198
0.759MetAsn: 0.759 ± 0.395
1.517MetPro: 1.517 ± 0.14
1.138MetGln: 1.138 ± 0.593
3.414MetArg: 3.414 ± 2.127
1.517MetSer: 1.517 ± 0.14
1.138MetThr: 1.138 ± 0.593
2.276MetVal: 2.276 ± 1.418
0.379MetTrp: 0.379 ± 0.453
1.138MetTyr: 1.138 ± 0.058
0.0MetXaa: 0.0 ± 0.0
Asn
7.587AsnAla: 7.587 ± 0.603
0.379AsnCys: 0.379 ± 0.198
2.656AsnAsp: 2.656 ± 0.082
1.138AsnGlu: 1.138 ± 0.593
3.414AsnPhe: 3.414 ± 0.174
2.276AsnGly: 2.276 ± 1.186
0.0AsnHis: 0.0 ± 0.0
1.517AsnIle: 1.517 ± 0.511
3.794AsnLys: 3.794 ± 1.326
3.035AsnLeu: 3.035 ± 1.022
1.517AsnMet: 1.517 ± 1.162
2.276AsnAsn: 2.276 ± 2.72
2.276AsnPro: 2.276 ± 0.116
0.759AsnGln: 0.759 ± 0.256
1.897AsnArg: 1.897 ± 0.314
3.414AsnSer: 3.414 ± 0.825
3.794AsnThr: 3.794 ± 0.627
2.656AsnVal: 2.656 ± 0.569
1.138AsnTrp: 1.138 ± 0.058
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.414ProAla: 3.414 ± 1.476
0.759ProCys: 0.759 ± 0.395
1.897ProAsp: 1.897 ± 2.267
2.276ProGlu: 2.276 ± 1.186
3.414ProPhe: 3.414 ± 2.127
3.414ProGly: 3.414 ± 0.477
1.138ProHis: 1.138 ± 0.709
4.932ProIle: 4.932 ± 0.617
3.414ProLys: 3.414 ± 1.128
2.656ProLeu: 2.656 ± 0.082
1.897ProMet: 1.897 ± 0.965
2.276ProAsn: 2.276 ± 0.535
2.656ProPro: 2.656 ± 2.522
1.517ProGln: 1.517 ± 0.511
1.897ProArg: 1.897 ± 0.337
4.173ProSer: 4.173 ± 3.033
3.414ProThr: 3.414 ± 2.778
3.414ProVal: 3.414 ± 0.477
0.759ProTrp: 0.759 ± 0.256
3.035ProTyr: 3.035 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
4.173GlnAla: 4.173 ± 0.429
1.138GlnCys: 1.138 ± 0.593
1.897GlnAsp: 1.897 ± 0.337
1.897GlnGlu: 1.897 ± 0.988
1.517GlnPhe: 1.517 ± 0.511
0.759GlnGly: 0.759 ± 0.256
0.759GlnHis: 0.759 ± 0.395
1.138GlnIle: 1.138 ± 0.709
0.759GlnLys: 0.759 ± 0.395
3.414GlnLeu: 3.414 ± 0.825
0.379GlnMet: 0.379 ± 0.265
1.138GlnAsn: 1.138 ± 0.058
2.276GlnPro: 2.276 ± 0.116
1.138GlnGln: 1.138 ± 0.058
1.897GlnArg: 1.897 ± 0.337
1.897GlnSer: 1.897 ± 0.337
1.897GlnThr: 1.897 ± 0.314
3.035GlnVal: 3.035 ± 0.371
1.897GlnTrp: 1.897 ± 0.314
0.759GlnTyr: 0.759 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
4.552ArgAla: 4.552 ± 1.07
0.379ArgCys: 0.379 ± 0.453
2.276ArgAsp: 2.276 ± 0.535
2.656ArgGlu: 2.656 ± 0.733
3.414ArgPhe: 3.414 ± 0.825
2.656ArgGly: 2.656 ± 0.082
0.379ArgHis: 0.379 ± 0.198
2.276ArgIle: 2.276 ± 1.186
5.311ArgLys: 5.311 ± 2.768
5.69ArgLeu: 5.69 ± 0.361
1.138ArgMet: 1.138 ± 0.058
1.517ArgAsn: 1.517 ± 0.791
3.035ArgPro: 3.035 ± 0.931
1.897ArgGln: 1.897 ± 0.965
1.897ArgArg: 1.897 ± 0.337
5.311ArgSer: 5.311 ± 0.815
2.656ArgThr: 2.656 ± 1.871
4.173ArgVal: 4.173 ± 0.873
0.0ArgTrp: 0.0 ± 0.0
1.138ArgTyr: 1.138 ± 0.709
0.0ArgXaa: 0.0 ± 0.0
Ser
4.932SerAla: 4.932 ± 1.336
0.759SerCys: 0.759 ± 0.395
4.173SerAsp: 4.173 ± 0.222
3.414SerGlu: 3.414 ± 0.825
4.173SerPhe: 4.173 ± 0.873
6.449SerGly: 6.449 ± 1.196
1.517SerHis: 1.517 ± 0.791
2.656SerIle: 2.656 ± 0.082
4.552SerLys: 4.552 ± 0.232
5.311SerLeu: 5.311 ± 0.487
2.276SerMet: 2.276 ± 0.116
4.932SerAsn: 4.932 ± 1.987
2.656SerPro: 2.656 ± 0.733
3.035SerGln: 3.035 ± 1.022
3.794SerArg: 3.794 ± 0.675
4.552SerSer: 4.552 ± 0.232
3.414SerThr: 3.414 ± 2.127
7.587SerVal: 7.587 ± 1.254
0.759SerTrp: 0.759 ± 0.256
2.656SerTyr: 2.656 ± 0.569
0.0SerXaa: 0.0 ± 0.0
Thr
3.035ThrAla: 3.035 ± 0.28
0.379ThrCys: 0.379 ± 0.198
3.794ThrAsp: 3.794 ± 1.278
1.517ThrGlu: 1.517 ± 0.791
4.552ThrPhe: 4.552 ± 0.883
3.414ThrGly: 3.414 ± 2.778
0.759ThrHis: 0.759 ± 0.395
4.173ThrIle: 4.173 ± 1.08
1.517ThrLys: 1.517 ± 0.14
4.552ThrLeu: 4.552 ± 0.883
1.138ThrMet: 1.138 ± 0.709
3.035ThrAsn: 3.035 ± 0.28
3.035ThrPro: 3.035 ± 1.673
1.897ThrGln: 1.897 ± 0.337
2.276ThrArg: 2.276 ± 0.535
3.035ThrSer: 3.035 ± 0.371
3.414ThrThr: 3.414 ± 2.127
3.414ThrVal: 3.414 ± 1.476
0.0ThrTrp: 0.0 ± 0.0
3.035ThrTyr: 3.035 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
6.449ValAla: 6.449 ± 1.847
2.276ValCys: 2.276 ± 0.535
3.035ValAsp: 3.035 ± 0.371
5.311ValGlu: 5.311 ± 2.117
4.552ValPhe: 4.552 ± 1.534
3.794ValGly: 3.794 ± 0.627
2.276ValHis: 2.276 ± 0.535
2.656ValIle: 2.656 ± 0.569
4.552ValLys: 4.552 ± 0.232
7.587ValLeu: 7.587 ± 2.652
2.656ValMet: 2.656 ± 0.082
2.276ValAsn: 2.276 ± 0.116
6.07ValPro: 6.07 ± 2.696
1.897ValGln: 1.897 ± 0.337
3.035ValArg: 3.035 ± 0.28
6.07ValSer: 6.07 ± 2.696
3.035ValThr: 3.035 ± 0.931
5.311ValVal: 5.311 ± 0.164
0.759ValTrp: 0.759 ± 0.907
3.035ValTyr: 3.035 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.379TrpAla: 0.379 ± 0.198
1.138TrpCys: 1.138 ± 0.058
0.379TrpAsp: 0.379 ± 0.198
0.379TrpGlu: 0.379 ± 0.198
1.138TrpPhe: 1.138 ± 0.058
0.379TrpGly: 0.379 ± 0.453
0.759TrpHis: 0.759 ± 0.395
0.759TrpIle: 0.759 ± 0.395
0.759TrpLys: 0.759 ± 0.395
1.138TrpLeu: 1.138 ± 0.709
0.379TrpMet: 0.379 ± 0.198
3.035TrpAsn: 3.035 ± 0.28
0.0TrpPro: 0.0 ± 0.0
0.379TrpGln: 0.379 ± 0.453
1.138TrpArg: 1.138 ± 0.058
1.138TrpSer: 1.138 ± 0.709
0.759TrpThr: 0.759 ± 0.395
0.379TrpVal: 0.379 ± 0.453
0.379TrpTrp: 0.379 ± 0.453
1.138TrpTyr: 1.138 ± 0.593
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.794TyrAla: 3.794 ± 1.278
0.759TyrCys: 0.759 ± 0.256
3.414TyrAsp: 3.414 ± 1.128
0.759TyrGlu: 0.759 ± 0.395
4.173TyrPhe: 4.173 ± 1.08
2.276TyrGly: 2.276 ± 0.116
1.138TyrHis: 1.138 ± 0.709
1.897TyrIle: 1.897 ± 0.337
3.035TyrLys: 3.035 ± 0.28
3.414TyrLeu: 3.414 ± 0.174
0.759TyrMet: 0.759 ± 0.209
1.138TyrAsn: 1.138 ± 0.709
2.656TyrPro: 2.656 ± 0.733
1.138TyrGln: 1.138 ± 0.058
3.035TyrArg: 3.035 ± 0.28
2.276TyrSer: 2.276 ± 0.535
2.276TyrThr: 2.276 ± 0.535
1.517TyrVal: 1.517 ± 0.511
0.759TyrTrp: 0.759 ± 0.256
1.517TyrTyr: 1.517 ± 0.791
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski