Amino acid dipepetide frequency for Wenzhou picorna-like virus 53

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.535AlaAla: 5.535 ± 0.311
0.369AlaCys: 0.369 ± 0.443
3.69AlaAsp: 3.69 ± 0.003
4.428AlaGlu: 4.428 ± 1.638
3.69AlaPhe: 3.69 ± 1.26
5.535AlaGly: 5.535 ± 2.216
1.845AlaHis: 1.845 ± 0.314
2.583AlaIle: 2.583 ± 0.061
2.583AlaLys: 2.583 ± 0.571
9.594AlaLeu: 9.594 ± 1.399
2.214AlaMet: 2.214 ± 0.503
2.952AlaAsn: 2.952 ± 2.277
2.214AlaPro: 2.214 ± 0.503
1.107AlaGln: 1.107 ± 0.696
2.583AlaArg: 2.583 ± 0.692
4.428AlaSer: 4.428 ± 2.152
4.059AlaThr: 4.059 ± 0.446
4.797AlaVal: 4.797 ± 0.699
0.738AlaTrp: 0.738 ± 0.253
2.583AlaTyr: 2.583 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
1.845CysAla: 1.845 ± 0.946
0.738CysCys: 0.738 ± 0.378
1.107CysAsp: 1.107 ± 0.064
0.738CysGlu: 0.738 ± 0.378
0.738CysPhe: 0.738 ± 0.885
2.214CysGly: 2.214 ± 0.503
0.0CysHis: 0.0 ± 0.0
0.738CysIle: 0.738 ± 0.253
1.476CysLys: 1.476 ± 0.125
1.107CysLeu: 1.107 ± 0.567
0.0CysMet: 0.0 ± 0.0
1.107CysAsn: 1.107 ± 0.064
0.369CysPro: 0.369 ± 0.189
0.369CysGln: 0.369 ± 0.189
0.0CysArg: 0.0 ± 0.0
0.738CysSer: 0.738 ± 0.378
0.369CysThr: 0.369 ± 0.189
1.476CysVal: 1.476 ± 0.125
0.738CysTrp: 0.738 ± 0.885
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.166AspAla: 5.166 ± 1.385
0.369AspCys: 0.369 ± 0.443
5.535AspAsp: 5.535 ± 0.311
7.38AspGlu: 7.38 ± 0.639
4.059AspPhe: 4.059 ± 1.078
2.952AspGly: 2.952 ± 0.882
0.369AspHis: 0.369 ± 0.189
1.845AspIle: 1.845 ± 0.949
2.214AspLys: 2.214 ± 1.135
4.059AspLeu: 4.059 ± 0.446
1.107AspMet: 1.107 ± 0.567
2.583AspAsn: 2.583 ± 0.571
3.69AspPro: 3.69 ± 2.53
1.476AspGln: 1.476 ± 0.125
1.107AspArg: 1.107 ± 0.064
2.583AspSer: 2.583 ± 0.571
2.583AspThr: 2.583 ± 0.692
5.166AspVal: 5.166 ± 2.016
2.214AspTrp: 2.214 ± 0.128
2.214AspTyr: 2.214 ± 1.135
0.0AspXaa: 0.0 ± 0.0
Glu
2.952GluAla: 2.952 ± 0.382
1.107GluCys: 1.107 ± 0.567
1.107GluAsp: 1.107 ± 0.064
4.428GluGlu: 4.428 ± 1.638
2.952GluPhe: 2.952 ± 1.513
4.059GluGly: 4.059 ± 1.449
1.107GluHis: 1.107 ± 0.567
5.904GluIle: 5.904 ± 1.131
5.535GluLys: 5.535 ± 2.206
5.904GluLeu: 5.904 ± 1.395
2.214GluMet: 2.214 ± 0.128
1.845GluAsn: 1.845 ± 0.314
2.214GluPro: 2.214 ± 0.128
1.476GluGln: 1.476 ± 0.757
2.214GluArg: 2.214 ± 0.503
4.059GluSer: 4.059 ± 0.817
2.952GluThr: 2.952 ± 0.882
4.059GluVal: 4.059 ± 1.078
1.107GluTrp: 1.107 ± 0.064
2.952GluTyr: 2.952 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.69PheAla: 3.69 ± 1.267
0.0PheCys: 0.0 ± 0.0
3.321PheAsp: 3.321 ± 0.193
2.583PheGlu: 2.583 ± 1.324
2.214PhePhe: 2.214 ± 0.76
3.69PheGly: 3.69 ± 0.628
1.476PheHis: 1.476 ± 0.125
1.845PheIle: 1.845 ± 0.314
1.107PheLys: 1.107 ± 0.567
4.797PheLeu: 4.797 ± 0.564
1.845PheMet: 1.845 ± 0.318
3.69PheAsn: 3.69 ± 2.53
1.845PhePro: 1.845 ± 0.946
2.952PheGln: 2.952 ± 0.882
2.583PheArg: 2.583 ± 0.061
5.166PheSer: 5.166 ± 0.121
1.845PheThr: 1.845 ± 0.314
2.583PheVal: 2.583 ± 0.692
0.738PheTrp: 0.738 ± 0.253
1.107PheTyr: 1.107 ± 0.696
0.0PheXaa: 0.0 ± 0.0
Gly
5.535GlyAla: 5.535 ± 0.321
0.738GlyCys: 0.738 ± 0.253
5.535GlyAsp: 5.535 ± 0.311
3.321GlyGlu: 3.321 ± 1.456
2.583GlyPhe: 2.583 ± 0.061
4.059GlyGly: 4.059 ± 0.446
2.214GlyHis: 2.214 ± 0.503
4.059GlyIle: 4.059 ± 0.817
2.952GlyLys: 2.952 ± 0.382
4.059GlyLeu: 4.059 ± 0.446
1.845GlyMet: 1.845 ± 0.318
3.321GlyAsn: 3.321 ± 0.193
2.952GlyPro: 2.952 ± 0.382
4.797GlyGln: 4.797 ± 1.196
2.952GlyArg: 2.952 ± 0.382
4.059GlySer: 4.059 ± 0.186
2.952GlyThr: 2.952 ± 1.013
5.535GlyVal: 5.535 ± 0.311
1.107GlyTrp: 1.107 ± 0.696
2.214GlyTyr: 2.214 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.064
1.107HisCys: 1.107 ± 0.567
1.476HisAsp: 1.476 ± 0.125
2.583HisGlu: 2.583 ± 0.692
1.476HisPhe: 1.476 ± 0.757
1.845HisGly: 1.845 ± 0.949
1.107HisHis: 1.107 ± 0.064
1.107HisIle: 1.107 ± 0.064
1.107HisLys: 1.107 ± 0.064
1.476HisLeu: 1.476 ± 0.757
0.369HisMet: 0.369 ± 0.189
1.476HisAsn: 1.476 ± 0.507
2.952HisPro: 2.952 ± 0.25
1.107HisGln: 1.107 ± 0.567
1.476HisArg: 1.476 ± 0.125
1.107HisSer: 1.107 ± 0.567
1.845HisThr: 1.845 ± 0.314
2.952HisVal: 2.952 ± 0.882
0.369HisTrp: 0.369 ± 0.189
1.476HisTyr: 1.476 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
3.69IleAla: 3.69 ± 0.628
0.738IleCys: 0.738 ± 0.253
2.952IleAsp: 2.952 ± 0.382
3.69IleGlu: 3.69 ± 0.003
2.214IlePhe: 2.214 ± 0.128
4.797IleGly: 4.797 ± 0.564
1.107IleHis: 1.107 ± 0.064
2.583IleIle: 2.583 ± 0.571
1.845IleLys: 1.845 ± 0.314
3.321IleLeu: 3.321 ± 1.456
1.107IleMet: 1.107 ± 0.567
2.583IleAsn: 2.583 ± 0.692
4.059IlePro: 4.059 ± 0.186
1.845IleGln: 1.845 ± 0.318
5.904IleArg: 5.904 ± 2.395
3.321IleSer: 3.321 ± 2.088
2.583IleThr: 2.583 ± 1.324
3.69IleVal: 3.69 ± 0.628
0.369IleTrp: 0.369 ± 0.189
1.476IleTyr: 1.476 ± 0.757
0.0IleXaa: 0.0 ± 0.0
Lys
2.952LysAla: 2.952 ± 1.513
1.107LysCys: 1.107 ± 0.567
2.583LysAsp: 2.583 ± 1.324
2.583LysGlu: 2.583 ± 1.324
2.952LysPhe: 2.952 ± 0.882
1.476LysGly: 1.476 ± 0.125
1.476LysHis: 1.476 ± 0.757
4.428LysIle: 4.428 ± 1.006
2.583LysLys: 2.583 ± 0.692
3.69LysLeu: 3.69 ± 0.635
0.369LysMet: 0.369 ± 0.189
1.476LysAsn: 1.476 ± 0.757
1.845LysPro: 1.845 ± 0.318
1.476LysGln: 1.476 ± 0.125
3.69LysArg: 3.69 ± 0.635
1.845LysSer: 1.845 ± 0.946
1.476LysThr: 1.476 ± 0.507
2.952LysVal: 2.952 ± 0.382
0.738LysTrp: 0.738 ± 0.378
2.214LysTyr: 2.214 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
5.904LeuAla: 5.904 ± 0.5
1.107LeuCys: 1.107 ± 0.567
5.535LeuAsp: 5.535 ± 0.311
4.428LeuGlu: 4.428 ± 0.375
2.952LeuPhe: 2.952 ± 0.382
4.428LeuGly: 4.428 ± 1.52
5.166LeuHis: 5.166 ± 1.142
4.428LeuIle: 4.428 ± 1.638
5.535LeuLys: 5.535 ± 0.311
8.487LeuLeu: 8.487 ± 1.192
2.583LeuMet: 2.583 ± 0.061
4.797LeuAsn: 4.797 ± 0.564
5.904LeuPro: 5.904 ± 1.395
4.059LeuGln: 4.059 ± 0.817
2.952LeuArg: 2.952 ± 0.882
8.856LeuSer: 8.856 ± 0.514
5.166LeuThr: 5.166 ± 2.405
5.535LeuVal: 5.535 ± 0.942
0.369LeuTrp: 0.369 ± 0.189
2.583LeuTyr: 2.583 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
0.738MetAla: 0.738 ± 0.885
0.369MetCys: 0.369 ± 0.189
1.476MetAsp: 1.476 ± 0.507
1.107MetGlu: 1.107 ± 0.064
1.845MetPhe: 1.845 ± 0.318
1.476MetGly: 1.476 ± 0.125
1.107MetHis: 1.107 ± 0.567
1.107MetIle: 1.107 ± 0.567
1.107MetLys: 1.107 ± 0.064
3.69MetLeu: 3.69 ± 1.891
0.738MetMet: 0.738 ± 0.378
0.369MetAsn: 0.369 ± 0.189
0.738MetPro: 0.738 ± 0.253
0.0MetGln: 0.0 ± 0.0
2.583MetArg: 2.583 ± 1.203
0.738MetSer: 0.738 ± 0.378
1.845MetThr: 1.845 ± 0.946
2.952MetVal: 2.952 ± 0.25
0.0MetTrp: 0.0 ± 0.0
0.738MetTyr: 0.738 ± 0.378
0.0MetXaa: 0.0 ± 0.0
Asn
2.952AsnAla: 2.952 ± 0.25
0.738AsnCys: 0.738 ± 0.378
1.107AsnAsp: 1.107 ± 0.567
1.845AsnGlu: 1.845 ± 0.318
3.321AsnPhe: 3.321 ± 1.456
5.535AsnGly: 5.535 ± 0.321
1.476AsnHis: 1.476 ± 0.125
2.214AsnIle: 2.214 ± 0.76
1.107AsnLys: 1.107 ± 0.064
6.642AsnLeu: 6.642 ± 0.385
1.107AsnMet: 1.107 ± 0.567
0.369AsnAsn: 0.369 ± 0.189
5.166AsnPro: 5.166 ± 1.142
1.845AsnGln: 1.845 ± 0.314
2.214AsnArg: 2.214 ± 0.128
3.69AsnSer: 3.69 ± 0.635
3.321AsnThr: 3.321 ± 0.439
3.69AsnVal: 3.69 ± 0.635
0.738AsnTrp: 0.738 ± 0.378
1.107AsnTyr: 1.107 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
2.952ProAla: 2.952 ± 2.277
1.107ProCys: 1.107 ± 0.064
2.952ProAsp: 2.952 ± 1.013
2.583ProGlu: 2.583 ± 0.692
1.476ProPhe: 1.476 ± 0.507
0.738ProGly: 0.738 ± 0.885
1.107ProHis: 1.107 ± 0.696
4.428ProIle: 4.428 ± 0.375
1.476ProLys: 1.476 ± 0.757
5.904ProLeu: 5.904 ± 1.763
1.476ProMet: 1.476 ± 0.125
2.583ProAsn: 2.583 ± 1.324
1.476ProPro: 1.476 ± 0.125
4.059ProGln: 4.059 ± 1.078
2.214ProArg: 2.214 ± 0.128
3.69ProSer: 3.69 ± 1.267
2.583ProThr: 2.583 ± 0.571
4.797ProVal: 4.797 ± 1.963
0.738ProTrp: 0.738 ± 0.885
1.845ProTyr: 1.845 ± 0.949
0.0ProXaa: 0.0 ± 0.0
Gln
4.059GlnAla: 4.059 ± 2.341
0.369GlnCys: 0.369 ± 0.443
1.107GlnAsp: 1.107 ± 0.064
1.845GlnGlu: 1.845 ± 0.318
1.845GlnPhe: 1.845 ± 0.314
1.476GlnGly: 1.476 ± 1.138
2.214GlnHis: 2.214 ± 0.503
2.214GlnIle: 2.214 ± 0.76
1.845GlnLys: 1.845 ± 0.946
2.952GlnLeu: 2.952 ± 0.882
0.738GlnMet: 0.738 ± 0.378
1.476GlnAsn: 1.476 ± 0.125
1.476GlnPro: 1.476 ± 0.125
0.738GlnGln: 0.738 ± 0.378
1.845GlnArg: 1.845 ± 0.314
4.428GlnSer: 4.428 ± 1.006
2.214GlnThr: 2.214 ± 0.503
2.583GlnVal: 2.583 ± 0.061
0.369GlnTrp: 0.369 ± 0.189
0.369GlnTyr: 0.369 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
1.845ArgAla: 1.845 ± 1.581
1.476ArgCys: 1.476 ± 0.757
1.845ArgAsp: 1.845 ± 0.314
1.845ArgGlu: 1.845 ± 0.318
2.583ArgPhe: 2.583 ± 0.061
4.059ArgGly: 4.059 ± 0.446
1.107ArgHis: 1.107 ± 0.567
2.214ArgIle: 2.214 ± 0.128
1.476ArgLys: 1.476 ± 0.757
4.428ArgLeu: 4.428 ± 0.375
1.476ArgMet: 1.476 ± 0.507
3.69ArgAsn: 3.69 ± 0.003
2.583ArgPro: 2.583 ± 0.061
2.214ArgGln: 2.214 ± 0.76
3.321ArgArg: 3.321 ± 1.071
3.69ArgSer: 3.69 ± 0.628
2.952ArgThr: 2.952 ± 0.382
4.059ArgVal: 4.059 ± 1.449
1.845ArgTrp: 1.845 ± 0.314
2.214ArgTyr: 2.214 ± 0.128
0.0ArgXaa: 0.0 ± 0.0
Ser
3.69SerAla: 3.69 ± 1.26
1.107SerCys: 1.107 ± 0.064
4.428SerAsp: 4.428 ± 0.375
3.69SerGlu: 3.69 ± 0.628
2.214SerPhe: 2.214 ± 0.76
5.535SerGly: 5.535 ± 0.953
1.476SerHis: 1.476 ± 0.757
4.428SerIle: 4.428 ± 0.257
4.428SerLys: 4.428 ± 1.006
4.797SerLeu: 4.797 ± 0.699
1.845SerMet: 1.845 ± 0.166
4.797SerAsn: 4.797 ± 1.963
2.214SerPro: 2.214 ± 0.76
2.214SerGln: 2.214 ± 0.128
2.952SerArg: 2.952 ± 0.382
4.059SerSer: 4.059 ± 2.341
4.797SerThr: 4.797 ± 3.226
4.428SerVal: 4.428 ± 2.27
0.738SerTrp: 0.738 ± 0.885
1.845SerTyr: 1.845 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
2.952ThrAla: 2.952 ± 1.645
0.369ThrCys: 0.369 ± 0.189
3.69ThrAsp: 3.69 ± 0.635
3.69ThrGlu: 3.69 ± 0.628
3.321ThrPhe: 3.321 ± 0.193
4.428ThrGly: 4.428 ± 0.375
1.845ThrHis: 1.845 ± 0.314
2.214ThrIle: 2.214 ± 0.76
2.214ThrLys: 2.214 ± 0.503
5.166ThrLeu: 5.166 ± 1.142
1.107ThrMet: 1.107 ± 0.064
3.69ThrAsn: 3.69 ± 0.628
3.321ThrPro: 3.321 ± 1.456
1.845ThrGln: 1.845 ± 0.949
2.583ThrArg: 2.583 ± 1.324
2.952ThrSer: 2.952 ± 1.013
5.166ThrThr: 5.166 ± 0.51
4.428ThrVal: 4.428 ± 0.257
1.476ThrTrp: 1.476 ± 0.125
2.583ThrTyr: 2.583 ± 1.834
0.0ThrXaa: 0.0 ± 0.0
Val
6.642ValAla: 6.642 ± 1.017
1.107ValCys: 1.107 ± 0.567
5.535ValAsp: 5.535 ± 0.311
3.321ValGlu: 3.321 ± 1.071
3.69ValPhe: 3.69 ± 1.26
7.011ValGly: 7.011 ± 1.067
1.476ValHis: 1.476 ± 0.757
4.059ValIle: 4.059 ± 0.817
1.476ValLys: 1.476 ± 0.125
6.273ValLeu: 6.273 ± 0.689
0.369ValMet: 0.369 ± 0.189
5.166ValAsn: 5.166 ± 0.121
3.321ValPro: 3.321 ± 0.439
1.107ValGln: 1.107 ± 0.064
4.428ValArg: 4.428 ± 0.257
4.797ValSer: 4.797 ± 0.699
6.273ValThr: 6.273 ± 1.206
6.642ValVal: 6.642 ± 0.246
1.107ValTrp: 1.107 ± 0.696
2.214ValTyr: 2.214 ± 0.128
0.0ValXaa: 0.0 ± 0.0
Trp
1.107TrpAla: 1.107 ± 0.064
0.369TrpCys: 0.369 ± 0.443
1.476TrpAsp: 1.476 ± 0.125
0.738TrpGlu: 0.738 ± 0.378
0.738TrpPhe: 0.738 ± 0.378
1.107TrpGly: 1.107 ± 0.696
0.738TrpHis: 0.738 ± 0.378
0.738TrpIle: 0.738 ± 0.378
0.369TrpLys: 0.369 ± 0.189
1.845TrpLeu: 1.845 ± 0.318
1.107TrpMet: 1.107 ± 0.211
0.369TrpAsn: 0.369 ± 0.443
0.369TrpPro: 0.369 ± 0.443
0.369TrpGln: 0.369 ± 0.443
0.738TrpArg: 0.738 ± 0.885
0.738TrpSer: 0.738 ± 0.253
1.845TrpThr: 1.845 ± 1.581
1.107TrpVal: 1.107 ± 0.064
0.738TrpTrp: 0.738 ± 0.253
0.369TrpTyr: 0.369 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.214TyrAla: 2.214 ± 0.76
1.476TyrCys: 1.476 ± 0.507
2.214TyrAsp: 2.214 ± 0.128
3.321TyrGlu: 3.321 ± 1.071
2.214TyrPhe: 2.214 ± 0.503
0.369TyrGly: 0.369 ± 0.189
1.107TyrHis: 1.107 ± 0.064
0.738TyrIle: 0.738 ± 0.378
1.476TyrLys: 1.476 ± 0.125
2.583TyrLeu: 2.583 ± 0.692
0.738TyrMet: 0.738 ± 0.378
1.845TyrAsn: 1.845 ± 0.946
1.476TyrPro: 1.476 ± 1.138
1.107TyrGln: 1.107 ± 0.064
2.583TyrArg: 2.583 ± 1.203
1.107TyrSer: 1.107 ± 0.064
2.214TyrThr: 2.214 ± 0.128
2.583TyrVal: 2.583 ± 1.203
0.738TyrTrp: 0.738 ± 0.885
1.845TyrTyr: 1.845 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2711 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski