Amino acid dipepetide frequency for Changjiang picorna-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.374AlaAla: 5.374 ± 2.308
0.384AlaCys: 0.384 ± 0.46
3.455AlaAsp: 3.455 ± 1.386
4.607AlaGlu: 4.607 ± 0.012
1.536AlaPhe: 1.536 ± 1.151
4.607AlaGly: 4.607 ± 1.389
0.384AlaHis: 0.384 ± 0.228
2.687AlaIle: 2.687 ± 0.222
3.071AlaLys: 3.071 ± 0.451
6.91AlaLeu: 6.91 ± 2.046
0.768AlaMet: 0.768 ± 0.457
4.99AlaAsn: 4.99 ± 1.16
3.839AlaPro: 3.839 ± 1.157
3.071AlaGln: 3.071 ± 0.451
2.687AlaArg: 2.687 ± 0.911
6.526AlaSer: 6.526 ± 0.935
4.223AlaThr: 4.223 ± 3.682
5.758AlaVal: 5.758 ± 1.392
0.768AlaTrp: 0.768 ± 0.231
6.142AlaTyr: 6.142 ± 0.901
0.0AlaXaa: 0.0 ± 0.0
Cys
2.687CysAla: 2.687 ± 0.911
0.384CysCys: 0.384 ± 0.228
1.152CysAsp: 1.152 ± 0.003
0.768CysGlu: 0.768 ± 0.457
0.768CysPhe: 0.768 ± 0.231
0.768CysGly: 0.768 ± 0.457
0.384CysHis: 0.384 ± 0.228
0.768CysIle: 0.768 ± 0.457
0.0CysLys: 0.0 ± 0.0
1.152CysLeu: 1.152 ± 0.685
0.384CysMet: 0.384 ± 0.46
0.0CysAsn: 0.0 ± 0.0
0.384CysPro: 0.384 ± 0.46
1.152CysGln: 1.152 ± 0.003
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.768CysThr: 0.768 ± 0.231
0.384CysVal: 0.384 ± 0.46
0.384CysTrp: 0.384 ± 0.46
1.536CysTyr: 1.536 ± 0.225
0.0CysXaa: 0.0 ± 0.0
Asp
3.455AspAla: 3.455 ± 0.679
0.384AspCys: 0.384 ± 0.228
5.374AspAsp: 5.374 ± 0.932
4.223AspGlu: 4.223 ± 1.824
2.687AspPhe: 2.687 ± 0.466
2.687AspGly: 2.687 ± 0.911
1.152AspHis: 1.152 ± 0.685
4.223AspIle: 4.223 ± 1.136
4.99AspLys: 4.99 ± 0.472
4.99AspLeu: 4.99 ± 1.593
1.536AspMet: 1.536 ± 0.914
1.919AspAsn: 1.919 ± 1.142
2.303AspPro: 2.303 ± 0.682
2.303AspGln: 2.303 ± 0.006
1.536AspArg: 1.536 ± 0.463
4.99AspSer: 4.99 ± 1.593
3.071AspThr: 3.071 ± 0.926
3.839AspVal: 3.839 ± 1.157
1.152AspTrp: 1.152 ± 0.003
3.071AspTyr: 3.071 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
3.839GluAla: 3.839 ± 0.219
0.384GluCys: 0.384 ± 0.228
3.071GluAsp: 3.071 ± 0.451
1.536GluGlu: 1.536 ± 0.914
3.839GluPhe: 3.839 ± 1.157
1.919GluGly: 1.919 ± 1.142
0.768GluHis: 0.768 ± 0.457
3.071GluIle: 3.071 ± 0.451
1.152GluLys: 1.152 ± 0.685
3.071GluLeu: 3.071 ± 1.827
1.919GluMet: 1.919 ± 0.454
2.687GluAsn: 2.687 ± 0.466
1.536GluPro: 1.536 ± 0.914
2.687GluGln: 2.687 ± 0.911
3.071GluArg: 3.071 ± 1.139
3.839GluSer: 3.839 ± 0.219
5.374GluThr: 5.374 ± 0.244
4.607GluVal: 4.607 ± 0.012
0.384GluTrp: 0.384 ± 0.228
3.071GluTyr: 3.071 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
2.687PheAla: 2.687 ± 1.154
2.303PheCys: 2.303 ± 0.006
4.99PheAsp: 4.99 ± 0.216
3.455PheGlu: 3.455 ± 0.679
1.536PhePhe: 1.536 ± 0.225
5.374PheGly: 5.374 ± 0.445
1.919PheHis: 1.919 ± 0.454
2.303PheIle: 2.303 ± 0.006
2.687PheLys: 2.687 ± 0.911
3.455PheLeu: 3.455 ± 0.679
0.768PheMet: 0.768 ± 0.231
2.687PheAsn: 2.687 ± 1.154
1.152PhePro: 1.152 ± 0.003
1.536PheGln: 1.536 ± 0.225
1.919PheArg: 1.919 ± 0.923
4.223PheSer: 4.223 ± 0.929
2.687PheThr: 2.687 ± 0.911
3.839PheVal: 3.839 ± 0.908
0.768PheTrp: 0.768 ± 0.231
3.071PheTyr: 3.071 ± 2.302
0.0PheXaa: 0.0 ± 0.0
Gly
4.223GlyAla: 4.223 ± 2.994
0.0GlyCys: 0.0 ± 0.0
5.374GlyAsp: 5.374 ± 1.133
2.303GlyGlu: 2.303 ± 0.694
2.687GlyPhe: 2.687 ± 0.466
5.758GlyGly: 5.758 ± 2.08
2.303GlyHis: 2.303 ± 1.383
6.526GlyIle: 6.526 ± 0.247
3.455GlyLys: 3.455 ± 0.679
3.071GlyLeu: 3.071 ± 0.238
0.768GlyMet: 0.768 ± 0.457
2.687GlyAsn: 2.687 ± 1.154
1.536GlyPro: 1.536 ± 1.151
2.303GlyGln: 2.303 ± 0.006
2.687GlyArg: 2.687 ± 0.222
5.374GlySer: 5.374 ± 0.932
6.91GlyThr: 6.91 ± 2.083
4.607GlyVal: 4.607 ± 0.676
0.384GlyTrp: 0.384 ± 0.228
2.687GlyTyr: 2.687 ± 0.222
0.0GlyXaa: 0.0 ± 0.0
His
1.152HisAla: 1.152 ± 0.003
0.768HisCys: 0.768 ± 0.457
1.536HisAsp: 1.536 ± 0.914
0.0HisGlu: 0.0 ± 0.0
1.919HisPhe: 1.919 ± 0.923
1.919HisGly: 1.919 ± 1.142
0.768HisHis: 0.768 ± 0.231
1.152HisIle: 1.152 ± 0.685
1.152HisLys: 1.152 ± 0.003
2.303HisLeu: 2.303 ± 0.694
1.152HisMet: 1.152 ± 0.685
2.303HisAsn: 2.303 ± 0.006
0.768HisPro: 0.768 ± 0.231
0.768HisGln: 0.768 ± 0.231
0.384HisArg: 0.384 ± 0.46
0.384HisSer: 0.384 ± 0.228
2.687HisThr: 2.687 ± 0.466
3.071HisVal: 3.071 ± 1.139
0.768HisTrp: 0.768 ± 0.457
0.384HisTyr: 0.384 ± 0.46
0.0HisXaa: 0.0 ± 0.0
Ile
3.839IleAla: 3.839 ± 0.908
1.152IleCys: 1.152 ± 0.685
3.455IleAsp: 3.455 ± 0.009
3.839IleGlu: 3.839 ± 0.469
3.071IlePhe: 3.071 ± 1.139
4.223IleGly: 4.223 ± 0.448
1.919IleHis: 1.919 ± 0.454
2.687IleIle: 2.687 ± 0.222
1.536IleLys: 1.536 ± 0.463
4.607IleLeu: 4.607 ± 0.676
1.919IleMet: 1.919 ± 1.142
1.919IleAsn: 1.919 ± 1.142
1.536IlePro: 1.536 ± 0.463
2.687IleGln: 2.687 ± 0.222
2.687IleArg: 2.687 ± 0.222
5.374IleSer: 5.374 ± 0.244
2.303IleThr: 2.303 ± 0.006
4.607IleVal: 4.607 ± 0.676
0.768IleTrp: 0.768 ± 0.92
2.687IleTyr: 2.687 ± 0.466
0.0IleXaa: 0.0 ± 0.0
Lys
3.071LysAla: 3.071 ± 0.451
0.384LysCys: 0.384 ± 0.228
2.303LysAsp: 2.303 ± 1.37
2.687LysGlu: 2.687 ± 1.599
3.839LysPhe: 3.839 ± 0.908
3.071LysGly: 3.071 ± 1.139
1.919LysHis: 1.919 ± 0.454
3.455LysIle: 3.455 ± 0.679
3.455LysLys: 3.455 ± 1.367
5.374LysLeu: 5.374 ± 0.445
0.768LysMet: 0.768 ± 0.457
2.687LysAsn: 2.687 ± 0.466
1.152LysPro: 1.152 ± 0.003
1.919LysGln: 1.919 ± 1.142
4.223LysArg: 4.223 ± 1.136
3.071LysSer: 3.071 ± 0.451
3.839LysThr: 3.839 ± 0.469
3.839LysVal: 3.839 ± 0.469
0.768LysTrp: 0.768 ± 0.457
1.919LysTyr: 1.919 ± 0.923
0.0LysXaa: 0.0 ± 0.0
Leu
3.455LeuAla: 3.455 ± 0.009
0.768LeuCys: 0.768 ± 0.231
4.223LeuAsp: 4.223 ± 1.136
3.071LeuGlu: 3.071 ± 1.139
4.99LeuPhe: 4.99 ± 0.216
5.374LeuGly: 5.374 ± 0.445
2.303LeuHis: 2.303 ± 1.37
2.687LeuIle: 2.687 ± 1.599
4.99LeuLys: 4.99 ± 1.16
6.91LeuLeu: 6.91 ± 0.706
0.768LeuMet: 0.768 ± 0.457
5.758LeuAsn: 5.758 ± 0.703
4.99LeuPro: 4.99 ± 0.472
3.455LeuGln: 3.455 ± 0.679
5.374LeuArg: 5.374 ± 1.133
5.758LeuSer: 5.758 ± 0.703
4.99LeuThr: 4.99 ± 0.904
7.294LeuVal: 7.294 ± 0.21
1.536LeuTrp: 1.536 ± 0.914
2.303LeuTyr: 2.303 ± 0.682
0.0LeuXaa: 0.0 ± 0.0
Met
3.071MetAla: 3.071 ± 1.139
0.768MetCys: 0.768 ± 0.457
1.152MetAsp: 1.152 ± 0.685
1.919MetGlu: 1.919 ± 0.234
1.919MetPhe: 1.919 ± 0.234
1.919MetGly: 1.919 ± 0.923
0.0MetHis: 0.0 ± 0.0
1.152MetIle: 1.152 ± 0.685
0.768MetLys: 0.768 ± 0.457
0.384MetLeu: 0.384 ± 0.228
0.384MetMet: 0.384 ± 0.228
0.384MetAsn: 0.384 ± 0.228
1.152MetPro: 1.152 ± 0.691
0.0MetGln: 0.0 ± 0.0
0.768MetArg: 0.768 ± 0.457
1.919MetSer: 1.919 ± 0.234
1.152MetThr: 1.152 ± 0.685
0.0MetVal: 0.0 ± 0.0
0.384MetTrp: 0.384 ± 0.228
1.536MetTyr: 1.536 ± 0.914
0.0MetXaa: 0.0 ± 0.0
Asn
4.607AsnAla: 4.607 ± 0.7
0.384AsnCys: 0.384 ± 0.228
1.536AsnAsp: 1.536 ± 0.225
1.919AsnGlu: 1.919 ± 0.234
0.384AsnPhe: 0.384 ± 0.228
3.839AsnGly: 3.839 ± 1.845
1.152AsnHis: 1.152 ± 0.003
3.071AsnIle: 3.071 ± 0.238
1.919AsnLys: 1.919 ± 1.142
4.223AsnLeu: 4.223 ± 1.136
0.768AsnMet: 0.768 ± 0.231
0.768AsnAsn: 0.768 ± 0.457
2.687AsnPro: 2.687 ± 1.154
0.384AsnGln: 0.384 ± 0.46
1.536AsnArg: 1.536 ± 0.225
3.839AsnSer: 3.839 ± 0.469
3.455AsnThr: 3.455 ± 1.386
6.526AsnVal: 6.526 ± 0.935
0.384AsnTrp: 0.384 ± 0.46
1.152AsnTyr: 1.152 ± 0.003
0.0AsnXaa: 0.0 ± 0.0
Pro
2.303ProAla: 2.303 ± 1.383
0.768ProCys: 0.768 ± 0.457
1.919ProAsp: 1.919 ± 0.234
1.536ProGlu: 1.536 ± 0.463
2.687ProPhe: 2.687 ± 1.154
3.455ProGly: 3.455 ± 2.762
0.384ProHis: 0.384 ± 0.46
1.152ProIle: 1.152 ± 0.003
2.303ProLys: 2.303 ± 0.006
4.223ProLeu: 4.223 ± 0.241
1.152ProMet: 1.152 ± 0.003
0.384ProAsn: 0.384 ± 0.46
1.919ProPro: 1.919 ± 0.923
1.152ProGln: 1.152 ± 0.685
2.687ProArg: 2.687 ± 0.466
3.455ProSer: 3.455 ± 0.679
3.839ProThr: 3.839 ± 2.534
1.919ProVal: 1.919 ± 0.923
0.384ProTrp: 0.384 ± 0.228
3.839ProTyr: 3.839 ± 1.157
0.0ProXaa: 0.0 ± 0.0
Gln
1.919GlnAla: 1.919 ± 0.234
0.384GlnCys: 0.384 ± 0.46
1.919GlnAsp: 1.919 ± 0.234
1.536GlnGlu: 1.536 ± 0.225
0.768GlnPhe: 0.768 ± 0.231
1.919GlnGly: 1.919 ± 0.454
1.152GlnHis: 1.152 ± 0.691
1.536GlnIle: 1.536 ± 0.225
2.303GlnLys: 2.303 ± 1.37
3.455GlnLeu: 3.455 ± 1.367
1.152GlnMet: 1.152 ± 0.691
1.536GlnAsn: 1.536 ± 0.225
2.303GlnPro: 2.303 ± 0.006
3.071GlnGln: 3.071 ± 1.139
3.071GlnArg: 3.071 ± 1.827
3.455GlnSer: 3.455 ± 0.009
2.303GlnThr: 2.303 ± 0.006
2.687GlnVal: 2.687 ± 0.466
0.0GlnTrp: 0.0 ± 0.0
0.384GlnTyr: 0.384 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
3.071ArgAla: 3.071 ± 0.238
0.384ArgCys: 0.384 ± 0.228
3.455ArgAsp: 3.455 ± 0.679
3.455ArgGlu: 3.455 ± 2.056
4.607ArgPhe: 4.607 ± 0.012
3.071ArgGly: 3.071 ± 0.451
1.152ArgHis: 1.152 ± 0.003
3.071ArgIle: 3.071 ± 0.451
2.303ArgLys: 2.303 ± 0.682
3.455ArgLeu: 3.455 ± 0.679
1.919ArgMet: 1.919 ± 1.142
2.303ArgAsn: 2.303 ± 0.006
2.687ArgPro: 2.687 ± 1.154
1.152ArgGln: 1.152 ± 0.003
1.919ArgArg: 1.919 ± 1.142
2.303ArgSer: 2.303 ± 1.37
3.071ArgThr: 3.071 ± 1.139
2.687ArgVal: 2.687 ± 0.222
0.384ArgTrp: 0.384 ± 0.46
2.303ArgTyr: 2.303 ± 1.383
0.0ArgXaa: 0.0 ± 0.0
Ser
6.91SerAla: 6.91 ± 1.395
0.768SerCys: 0.768 ± 0.231
5.374SerAsp: 5.374 ± 2.509
3.071SerGlu: 3.071 ± 0.926
6.91SerPhe: 6.91 ± 0.67
3.071SerGly: 3.071 ± 0.926
1.919SerHis: 1.919 ± 0.234
4.99SerIle: 4.99 ± 0.472
3.455SerLys: 3.455 ± 1.367
8.061SerLeu: 8.061 ± 0.021
0.768SerMet: 0.768 ± 0.457
3.071SerAsn: 3.071 ± 0.451
3.839SerPro: 3.839 ± 0.469
1.919SerGln: 1.919 ± 1.142
3.455SerArg: 3.455 ± 0.679
4.607SerSer: 4.607 ± 0.7
3.071SerThr: 3.071 ± 0.926
4.99SerVal: 4.99 ± 1.16
0.768SerTrp: 0.768 ± 0.92
2.303SerTyr: 2.303 ± 1.383
0.0SerXaa: 0.0 ± 0.0
Thr
6.142ThrAla: 6.142 ± 1.163
0.768ThrCys: 0.768 ± 0.92
3.839ThrAsp: 3.839 ± 0.469
4.99ThrGlu: 4.99 ± 0.216
3.071ThrPhe: 3.071 ± 0.451
4.223ThrGly: 4.223 ± 3.682
1.536ThrHis: 1.536 ± 0.225
6.142ThrIle: 6.142 ± 2.54
5.374ThrLys: 5.374 ± 2.509
7.294ThrLeu: 7.294 ± 2.543
0.0ThrMet: 0.0 ± 0.256
1.919ThrAsn: 1.919 ± 0.923
3.455ThrPro: 3.455 ± 1.386
2.687ThrGln: 2.687 ± 0.466
3.455ThrArg: 3.455 ± 0.697
4.607ThrSer: 4.607 ± 0.676
5.374ThrThr: 5.374 ± 2.308
1.919ThrVal: 1.919 ± 0.454
0.768ThrTrp: 0.768 ± 0.457
1.919ThrTyr: 1.919 ± 0.234
0.0ThrXaa: 0.0 ± 0.0
Val
5.374ValAla: 5.374 ± 0.445
0.768ValCys: 0.768 ± 0.231
3.455ValAsp: 3.455 ± 0.679
3.839ValGlu: 3.839 ± 0.908
1.919ValPhe: 1.919 ± 0.454
4.607ValGly: 4.607 ± 0.7
2.303ValHis: 2.303 ± 1.37
3.071ValIle: 3.071 ± 1.139
5.374ValLys: 5.374 ± 1.133
4.99ValLeu: 4.99 ± 0.216
1.152ValMet: 1.152 ± 0.003
4.223ValAsn: 4.223 ± 1.617
3.071ValPro: 3.071 ± 2.99
3.071ValGln: 3.071 ± 0.238
3.839ValArg: 3.839 ± 2.284
5.758ValSer: 5.758 ± 2.768
4.99ValThr: 4.99 ± 1.16
4.99ValVal: 4.99 ± 1.593
0.0ValTrp: 0.0 ± 0.0
3.455ValTyr: 3.455 ± 1.386
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.457
0.384TrpCys: 0.384 ± 0.228
0.0TrpAsp: 0.0 ± 0.0
0.384TrpGlu: 0.384 ± 0.228
0.384TrpPhe: 0.384 ± 0.228
0.384TrpGly: 0.384 ± 0.46
0.768TrpHis: 0.768 ± 0.231
0.384TrpIle: 0.384 ± 0.228
1.152TrpLys: 1.152 ± 0.691
1.152TrpLeu: 1.152 ± 0.003
0.384TrpMet: 0.384 ± 0.228
1.152TrpAsn: 1.152 ± 0.003
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.768TrpArg: 0.768 ± 0.231
0.384TrpSer: 0.384 ± 0.228
0.768TrpThr: 0.768 ± 0.231
0.768TrpVal: 0.768 ± 0.231
0.384TrpTrp: 0.384 ± 0.228
1.152TrpTyr: 1.152 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.839TyrAla: 3.839 ± 3.222
1.536TyrCys: 1.536 ± 0.225
2.303TyrAsp: 2.303 ± 0.682
2.687TyrGlu: 2.687 ± 1.599
3.839TyrPhe: 3.839 ± 0.219
3.455TyrGly: 3.455 ± 2.074
1.152TyrHis: 1.152 ± 0.691
2.687TyrIle: 2.687 ± 0.222
2.303TyrLys: 2.303 ± 0.006
1.919TyrLeu: 1.919 ± 0.234
1.919TyrMet: 1.919 ± 1.048
1.152TyrAsn: 1.152 ± 0.003
1.152TyrPro: 1.152 ± 0.003
1.536TyrGln: 1.536 ± 1.151
2.687TyrArg: 2.687 ± 1.154
3.455TyrSer: 3.455 ± 0.009
4.99TyrThr: 4.99 ± 0.216
1.919TyrVal: 1.919 ± 0.454
0.384TyrTrp: 0.384 ± 0.228
4.99TyrTyr: 4.99 ± 1.593
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski