Amino acid dipepetide frequency for Wenzhou picorna-like virus 33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.795AlaAla: 9.795 ± 0.322
1.866AlaCys: 1.866 ± 0.924
4.664AlaAsp: 4.664 ± 1.449
3.731AlaGlu: 3.731 ± 1.849
4.198AlaPhe: 4.198 ± 0.507
9.328AlaGly: 9.328 ± 1.415
2.332AlaHis: 2.332 ± 0.293
4.198AlaIle: 4.198 ± 2.08
4.664AlaLys: 4.664 ± 0.276
5.131AlaLeu: 5.131 ± 0.817
3.731AlaMet: 3.731 ± 1.849
4.198AlaAsn: 4.198 ± 2.233
7.929AlaPro: 7.929 ± 2.971
2.799AlaGln: 2.799 ± 2.063
5.597AlaArg: 5.597 ± 1.048
6.996AlaSer: 6.996 ± 1.708
1.866AlaThr: 1.866 ± 1.663
10.261AlaVal: 10.261 ± 1.816
1.399AlaTrp: 1.399 ± 1.894
1.399AlaTyr: 1.399 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
1.866CysAla: 1.866 ± 0.062
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.933CysPhe: 0.933 ± 0.4
0.933CysGly: 0.933 ± 0.462
0.466CysHis: 0.466 ± 0.231
1.399CysIle: 1.399 ± 0.693
0.466CysLys: 0.466 ± 0.231
0.466CysLeu: 0.466 ± 0.231
0.466CysMet: 0.466 ± 0.231
0.466CysAsn: 0.466 ± 0.231
1.866CysPro: 1.866 ± 0.062
0.0CysGln: 0.0 ± 0.0
1.399CysArg: 1.399 ± 0.693
0.933CysSer: 0.933 ± 0.4
1.866CysThr: 1.866 ± 0.924
0.466CysVal: 0.466 ± 0.231
0.0CysTrp: 0.0 ± 0.0
0.933CysTyr: 0.933 ± 0.462
0.0CysXaa: 0.0 ± 0.0
Asp
2.799AspAla: 2.799 ± 1.201
1.399AspCys: 1.399 ± 0.693
4.664AspAsp: 4.664 ± 1.449
2.799AspGlu: 2.799 ± 0.524
2.799AspPhe: 2.799 ± 0.524
3.731AspGly: 3.731 ± 1.601
0.933AspHis: 0.933 ± 0.462
4.664AspIle: 4.664 ± 1.449
1.399AspLys: 1.399 ± 0.693
4.664AspLeu: 4.664 ± 1.139
1.866AspMet: 1.866 ± 0.062
2.332AspAsn: 2.332 ± 0.293
2.332AspPro: 2.332 ± 0.293
0.933AspGln: 0.933 ± 0.462
2.332AspArg: 2.332 ± 0.293
3.265AspSer: 3.265 ± 0.107
3.265AspThr: 3.265 ± 0.755
2.799AspVal: 2.799 ± 1.387
1.866AspTrp: 1.866 ± 0.062
2.799AspTyr: 2.799 ± 0.524
0.0AspXaa: 0.0 ± 0.0
Glu
6.063GluAla: 6.063 ± 1.28
0.933GluCys: 0.933 ± 0.462
1.866GluAsp: 1.866 ± 0.924
4.664GluGlu: 4.664 ± 2.311
2.799GluPhe: 2.799 ± 1.201
2.332GluGly: 2.332 ± 1.156
1.399GluHis: 1.399 ± 0.693
1.399GluIle: 1.399 ± 0.693
3.265GluLys: 3.265 ± 0.755
3.265GluLeu: 3.265 ± 1.618
1.866GluMet: 1.866 ± 0.062
1.399GluAsn: 1.399 ± 0.693
0.933GluPro: 0.933 ± 0.462
2.799GluGln: 2.799 ± 1.387
4.198GluArg: 4.198 ± 2.08
3.265GluSer: 3.265 ± 0.97
1.399GluThr: 1.399 ± 0.693
3.731GluVal: 3.731 ± 0.124
0.933GluTrp: 0.933 ± 0.462
1.399GluTyr: 1.399 ± 0.169
0.0GluXaa: 0.0 ± 0.0
Phe
6.063PheAla: 6.063 ± 2.171
0.466PheCys: 0.466 ± 0.231
2.799PheAsp: 2.799 ± 0.338
2.332PheGlu: 2.332 ± 0.569
0.466PhePhe: 0.466 ± 0.231
5.131PheGly: 5.131 ± 0.817
2.332PheHis: 2.332 ± 0.569
3.731PheIle: 3.731 ± 0.739
3.731PheLys: 3.731 ± 1.849
3.731PheLeu: 3.731 ± 0.986
0.466PheMet: 0.466 ± 0.548
0.933PheAsn: 0.933 ± 0.4
1.866PhePro: 1.866 ± 0.801
1.399PheGln: 1.399 ± 0.169
2.332PheArg: 2.332 ± 2.295
3.731PheSer: 3.731 ± 0.739
2.332PheThr: 2.332 ± 1.156
3.265PheVal: 3.265 ± 0.755
0.0PheTrp: 0.0 ± 0.0
1.866PheTyr: 1.866 ± 0.801
0.0PheXaa: 0.0 ± 0.0
Gly
8.396GlyAla: 8.396 ± 0.71
0.466GlyCys: 0.466 ± 0.231
6.063GlyAsp: 6.063 ± 1.308
4.664GlyGlu: 4.664 ± 0.276
3.731GlyPhe: 3.731 ± 0.739
6.996GlyGly: 6.996 ± 1.708
0.933GlyHis: 0.933 ± 0.462
6.063GlyIle: 6.063 ± 0.446
2.799GlyLys: 2.799 ± 1.387
3.265GlyLeu: 3.265 ± 0.97
0.933GlyMet: 0.933 ± 0.4
2.799GlyAsn: 2.799 ± 0.338
2.332GlyPro: 2.332 ± 2.295
0.933GlyGln: 0.933 ± 0.462
3.731GlyArg: 3.731 ± 0.124
5.131GlySer: 5.131 ± 1.77
3.731GlyThr: 3.731 ± 2.464
6.996GlyVal: 6.996 ± 1.708
1.399GlyTrp: 1.399 ± 0.693
1.866GlyTyr: 1.866 ± 0.924
0.0GlyXaa: 0.0 ± 0.0
His
1.399HisAla: 1.399 ± 1.032
0.466HisCys: 0.466 ± 0.231
0.933HisAsp: 0.933 ± 0.462
1.866HisGlu: 1.866 ± 0.924
1.399HisPhe: 1.399 ± 0.169
1.399HisGly: 1.399 ± 0.169
0.0HisHis: 0.0 ± 0.0
0.466HisIle: 0.466 ± 0.231
1.399HisLys: 1.399 ± 0.693
2.799HisLeu: 2.799 ± 1.387
0.933HisMet: 0.933 ± 1.263
1.866HisAsn: 1.866 ± 0.062
0.933HisPro: 0.933 ± 0.4
0.466HisGln: 0.466 ± 0.231
0.466HisArg: 0.466 ± 0.231
2.332HisSer: 2.332 ± 0.293
1.399HisThr: 1.399 ± 0.169
3.731HisVal: 3.731 ± 0.124
0.0HisTrp: 0.0 ± 0.0
0.466HisTyr: 0.466 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
6.996IleAla: 6.996 ± 0.879
0.933IleCys: 0.933 ± 0.462
3.731IleAsp: 3.731 ± 0.739
2.799IleGlu: 2.799 ± 0.524
2.332IlePhe: 2.332 ± 0.293
4.198IleGly: 4.198 ± 1.37
2.332IleHis: 2.332 ± 1.156
0.933IleIle: 0.933 ± 0.462
2.332IleLys: 2.332 ± 1.156
4.198IleLeu: 4.198 ± 1.218
3.265IleMet: 3.265 ± 0.97
1.399IleAsn: 1.399 ± 0.169
1.399IlePro: 1.399 ± 1.894
2.332IleGln: 2.332 ± 1.156
1.399IleArg: 1.399 ± 0.169
4.664IleSer: 4.664 ± 0.276
2.799IleThr: 2.799 ± 0.524
3.265IleVal: 3.265 ± 0.107
0.466IleTrp: 0.466 ± 0.631
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.597LysAla: 5.597 ± 1.048
0.466LysCys: 0.466 ± 0.231
3.265LysAsp: 3.265 ± 0.755
0.933LysGlu: 0.933 ± 0.462
2.332LysPhe: 2.332 ± 0.569
1.866LysGly: 1.866 ± 0.924
0.933LysHis: 0.933 ± 0.462
1.866LysIle: 1.866 ± 0.924
2.799LysLys: 2.799 ± 1.387
3.265LysLeu: 3.265 ± 1.618
1.399LysMet: 1.399 ± 0.693
0.933LysAsn: 0.933 ± 0.462
3.731LysPro: 3.731 ± 0.739
1.399LysGln: 1.399 ± 0.169
2.799LysArg: 2.799 ± 1.387
2.799LysSer: 2.799 ± 0.524
4.664LysThr: 4.664 ± 1.449
1.866LysVal: 1.866 ± 0.062
0.0LysTrp: 0.0 ± 0.0
0.933LysTyr: 0.933 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
8.862LeuAla: 8.862 ± 0.784
0.0LeuCys: 0.0 ± 0.0
5.131LeuAsp: 5.131 ± 2.542
4.664LeuGlu: 4.664 ± 2.311
3.265LeuPhe: 3.265 ± 0.755
1.866LeuGly: 1.866 ± 0.062
3.265LeuHis: 3.265 ± 0.107
3.265LeuIle: 3.265 ± 1.618
4.198LeuLys: 4.198 ± 1.218
5.131LeuLeu: 5.131 ± 0.045
3.265LeuMet: 3.265 ± 0.755
1.866LeuAsn: 1.866 ± 0.924
3.265LeuPro: 3.265 ± 0.107
1.866LeuGln: 1.866 ± 0.062
6.063LeuArg: 6.063 ± 2.142
4.664LeuSer: 4.664 ± 1.139
5.131LeuThr: 5.131 ± 0.817
6.063LeuVal: 6.063 ± 0.417
1.866LeuTrp: 1.866 ± 1.663
2.799LeuTyr: 2.799 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
2.332MetAla: 2.332 ± 1.156
1.399MetCys: 1.399 ± 0.693
1.399MetAsp: 1.399 ± 0.169
1.866MetGlu: 1.866 ± 0.924
2.332MetPhe: 2.332 ± 0.293
1.399MetGly: 1.399 ± 0.169
0.933MetHis: 0.933 ± 0.462
0.466MetIle: 0.466 ± 0.231
1.399MetLys: 1.399 ± 0.693
2.332MetLeu: 2.332 ± 1.432
0.466MetMet: 0.466 ± 0.231
0.466MetAsn: 0.466 ± 0.231
1.866MetPro: 1.866 ± 0.801
1.866MetGln: 1.866 ± 0.801
1.866MetArg: 1.866 ± 0.062
3.731MetSer: 3.731 ± 0.739
0.933MetThr: 0.933 ± 0.462
2.332MetVal: 2.332 ± 0.569
0.933MetTrp: 0.933 ± 0.4
1.399MetTyr: 1.399 ± 0.693
0.0MetXaa: 0.0 ± 0.0
Asn
1.866AsnAla: 1.866 ± 0.801
0.0AsnCys: 0.0 ± 0.0
1.399AsnAsp: 1.399 ± 0.169
0.933AsnGlu: 0.933 ± 0.462
1.866AsnPhe: 1.866 ± 0.924
3.731AsnGly: 3.731 ± 0.739
0.466AsnHis: 0.466 ± 0.231
2.332AsnIle: 2.332 ± 0.569
0.466AsnLys: 0.466 ± 0.231
2.332AsnLeu: 2.332 ± 0.293
1.399AsnMet: 1.399 ± 0.229
2.332AsnAsn: 2.332 ± 2.295
0.933AsnPro: 0.933 ± 0.462
0.933AsnGln: 0.933 ± 0.462
0.933AsnArg: 0.933 ± 0.4
2.332AsnSer: 2.332 ± 0.293
3.731AsnThr: 3.731 ± 1.601
4.198AsnVal: 4.198 ± 0.507
0.933AsnTrp: 0.933 ± 0.4
2.332AsnTyr: 2.332 ± 0.569
0.0AsnXaa: 0.0 ± 0.0
Pro
3.265ProAla: 3.265 ± 1.832
1.866ProCys: 1.866 ± 0.801
1.399ProAsp: 1.399 ± 0.169
3.265ProGlu: 3.265 ± 1.618
0.933ProPhe: 0.933 ± 0.4
3.731ProGly: 3.731 ± 2.464
1.399ProHis: 1.399 ± 0.169
2.799ProIle: 2.799 ± 2.063
0.933ProLys: 0.933 ± 0.4
4.198ProLeu: 4.198 ± 0.507
1.866ProMet: 1.866 ± 1.663
1.399ProAsn: 1.399 ± 1.894
2.332ProPro: 2.332 ± 0.293
2.332ProGln: 2.332 ± 1.156
1.866ProArg: 1.866 ± 0.801
4.198ProSer: 4.198 ± 1.37
4.664ProThr: 4.664 ± 1.139
4.198ProVal: 4.198 ± 1.37
0.933ProTrp: 0.933 ± 0.462
0.933ProTyr: 0.933 ± 0.4
0.0ProXaa: 0.0 ± 0.0
Gln
3.265GlnAla: 3.265 ± 1.618
0.933GlnCys: 0.933 ± 0.4
0.466GlnAsp: 0.466 ± 0.231
0.933GlnGlu: 0.933 ± 0.462
0.466GlnPhe: 0.466 ± 0.231
0.933GlnGly: 0.933 ± 0.4
0.933GlnHis: 0.933 ± 1.263
2.332GlnIle: 2.332 ± 0.569
1.399GlnLys: 1.399 ± 0.169
2.799GlnLeu: 2.799 ± 0.524
0.933GlnMet: 0.933 ± 0.462
0.933GlnAsn: 0.933 ± 0.462
2.799GlnPro: 2.799 ± 0.524
0.466GlnGln: 0.466 ± 0.231
1.866GlnArg: 1.866 ± 0.801
3.731GlnSer: 3.731 ± 0.739
0.933GlnThr: 0.933 ± 0.4
4.198GlnVal: 4.198 ± 2.08
0.466GlnTrp: 0.466 ± 0.231
2.332GlnTyr: 2.332 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
4.664ArgAla: 4.664 ± 0.586
0.933ArgCys: 0.933 ± 0.4
3.731ArgAsp: 3.731 ± 0.986
3.265ArgGlu: 3.265 ± 1.618
3.731ArgPhe: 3.731 ± 0.124
3.731ArgGly: 3.731 ± 1.601
1.399ArgHis: 1.399 ± 0.693
0.933ArgIle: 0.933 ± 0.4
1.866ArgLys: 1.866 ± 0.924
4.664ArgLeu: 4.664 ± 0.276
0.933ArgMet: 0.933 ± 0.462
1.399ArgAsn: 1.399 ± 0.693
2.332ArgPro: 2.332 ± 0.293
2.799ArgGln: 2.799 ± 0.524
4.198ArgArg: 4.198 ± 1.218
2.799ArgSer: 2.799 ± 0.524
4.664ArgThr: 4.664 ± 1.449
5.131ArgVal: 5.131 ± 1.77
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.664SerAla: 4.664 ± 1.139
0.466SerCys: 0.466 ± 0.231
2.332SerAsp: 2.332 ± 0.293
3.265SerGlu: 3.265 ± 0.107
4.664SerPhe: 4.664 ± 1.139
7.929SerGly: 7.929 ± 1.341
2.332SerHis: 2.332 ± 0.569
3.731SerIle: 3.731 ± 0.124
3.265SerLys: 3.265 ± 1.832
8.396SerLeu: 8.396 ± 1.573
1.399SerMet: 1.399 ± 0.693
1.399SerAsn: 1.399 ± 1.032
2.799SerPro: 2.799 ± 2.926
3.265SerGln: 3.265 ± 0.97
3.265SerArg: 3.265 ± 0.107
5.597SerSer: 5.597 ± 0.677
6.53SerThr: 6.53 ± 3.665
7.463SerVal: 7.463 ± 0.248
0.933SerTrp: 0.933 ± 1.263
2.799SerTyr: 2.799 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
6.063ThrAla: 6.063 ± 2.171
0.933ThrCys: 0.933 ± 0.462
2.799ThrAsp: 2.799 ± 1.201
1.399ThrGlu: 1.399 ± 0.169
3.265ThrPhe: 3.265 ± 0.97
2.799ThrGly: 2.799 ± 1.201
0.466ThrHis: 0.466 ± 0.631
5.131ThrIle: 5.131 ± 0.908
2.332ThrLys: 2.332 ± 1.156
5.131ThrLeu: 5.131 ± 1.68
2.332ThrMet: 2.332 ± 0.569
3.731ThrAsn: 3.731 ± 0.124
1.866ThrPro: 1.866 ± 0.924
2.332ThrGln: 2.332 ± 0.293
2.332ThrArg: 2.332 ± 0.569
5.597ThrSer: 5.597 ± 1.048
6.063ThrThr: 6.063 ± 2.171
3.265ThrVal: 3.265 ± 0.97
0.466ThrTrp: 0.466 ± 0.231
2.799ThrTyr: 2.799 ± 0.338
0.0ThrXaa: 0.0 ± 0.0
Val
9.328ValAla: 9.328 ± 1.415
0.466ValCys: 0.466 ± 0.231
4.198ValAsp: 4.198 ± 0.507
4.664ValGlu: 4.664 ± 0.586
4.198ValPhe: 4.198 ± 0.355
8.396ValGly: 8.396 ± 1.015
1.399ValHis: 1.399 ± 0.693
2.332ValIle: 2.332 ± 0.293
2.332ValLys: 2.332 ± 0.293
6.996ValLeu: 6.996 ± 0.879
3.265ValMet: 3.265 ± 0.755
2.799ValAsn: 2.799 ± 0.338
4.198ValPro: 4.198 ± 3.095
2.799ValGln: 2.799 ± 0.338
4.664ValArg: 4.664 ± 1.449
5.131ValSer: 5.131 ± 0.908
3.731ValThr: 3.731 ± 1.601
5.597ValVal: 5.597 ± 1.048
1.399ValTrp: 1.399 ± 0.693
2.799ValTyr: 2.799 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.933TrpAla: 0.933 ± 0.4
0.466TrpCys: 0.466 ± 0.231
0.933TrpAsp: 0.933 ± 0.462
0.466TrpGlu: 0.466 ± 0.231
1.399TrpPhe: 1.399 ± 1.032
1.399TrpGly: 1.399 ± 1.032
0.466TrpHis: 0.466 ± 0.631
0.933TrpIle: 0.933 ± 0.4
0.0TrpLys: 0.0 ± 0.0
1.399TrpLeu: 1.399 ± 0.693
0.0TrpMet: 0.0 ± 0.0
1.399TrpAsn: 1.399 ± 0.693
1.866TrpPro: 1.866 ± 0.801
0.466TrpGln: 0.466 ± 0.231
0.466TrpArg: 0.466 ± 0.231
1.866TrpSer: 1.866 ± 1.663
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.466TrpTyr: 0.466 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.866TyrAla: 1.866 ± 0.062
0.466TyrCys: 0.466 ± 0.231
1.866TyrAsp: 1.866 ± 0.924
1.399TyrGlu: 1.399 ± 0.169
2.332TyrPhe: 2.332 ± 0.569
1.399TyrGly: 1.399 ± 0.693
0.0TyrHis: 0.0 ± 0.0
2.799TyrIle: 2.799 ± 1.201
2.799TyrLys: 2.799 ± 1.387
2.332TyrLeu: 2.332 ± 0.293
0.466TyrMet: 0.466 ± 0.231
1.399TyrAsn: 1.399 ± 1.032
0.933TyrPro: 0.933 ± 0.4
0.933TyrGln: 0.933 ± 0.4
1.399TyrArg: 1.399 ± 0.169
3.731TyrSer: 3.731 ± 0.124
1.399TyrThr: 1.399 ± 0.693
1.866TyrVal: 1.866 ± 0.924
0.933TyrTrp: 0.933 ± 0.462
0.933TyrTyr: 0.933 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski