Amino acid dipepetide frequency for Changjiang picorna-like virus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.048AlaAla: 2.048 ± 0.244
0.819AlaCys: 0.819 ± 0.427
2.868AlaAsp: 2.868 ± 0.473
4.097AlaGlu: 4.097 ± 0.489
3.687AlaPhe: 3.687 ± 0.61
4.506AlaGly: 4.506 ± 2.243
0.0AlaHis: 0.0 ± 0.0
4.097AlaIle: 4.097 ± 0.489
3.687AlaLys: 3.687 ± 0.61
3.687AlaLeu: 3.687 ± 0.046
0.819AlaMet: 0.819 ± 0.427
2.868AlaAsn: 2.868 ± 0.183
4.097AlaPro: 4.097 ± 0.489
2.458AlaGln: 2.458 ± 0.031
2.458AlaArg: 2.458 ± 0.625
5.735AlaSer: 5.735 ± 2.915
4.916AlaThr: 4.916 ± 3.342
4.506AlaVal: 4.506 ± 1.037
0.41AlaTrp: 0.41 ± 0.214
4.097AlaTyr: 4.097 ± 0.489
0.0AlaXaa: 0.0 ± 0.0
Cys
1.639CysAla: 1.639 ± 0.458
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.819CysGlu: 0.819 ± 0.427
0.41CysPhe: 0.41 ± 0.442
1.229CysGly: 1.229 ± 0.641
0.41CysHis: 0.41 ± 0.214
0.41CysIle: 0.41 ± 0.214
0.41CysLys: 0.41 ± 0.214
1.229CysLeu: 1.229 ± 0.015
0.41CysMet: 0.41 ± 0.214
0.0CysAsn: 0.0 ± 0.0
0.819CysPro: 0.819 ± 0.427
0.819CysGln: 0.819 ± 0.427
1.229CysArg: 1.229 ± 0.015
2.048CysSer: 2.048 ± 0.9
0.819CysThr: 0.819 ± 0.427
1.229CysVal: 1.229 ± 0.641
0.0CysTrp: 0.0 ± 0.0
0.819CysTyr: 0.819 ± 0.427
0.0CysXaa: 0.0 ± 0.0
Asp
3.277AspAla: 3.277 ± 0.26
1.229AspCys: 1.229 ± 0.671
3.277AspAsp: 3.277 ± 1.052
3.277AspGlu: 3.277 ± 0.26
4.506AspPhe: 4.506 ± 1.037
1.639AspGly: 1.639 ± 0.854
0.41AspHis: 0.41 ± 0.214
3.277AspIle: 3.277 ± 1.708
3.687AspLys: 3.687 ± 1.266
2.868AspLeu: 2.868 ± 0.183
0.819AspMet: 0.819 ± 0.427
1.639AspAsn: 1.639 ± 0.198
2.048AspPro: 2.048 ± 0.412
0.819AspGln: 0.819 ± 0.229
3.687AspArg: 3.687 ± 1.266
3.277AspSer: 3.277 ± 0.916
3.277AspThr: 3.277 ± 0.26
5.326AspVal: 5.326 ± 1.16
1.639AspTrp: 1.639 ± 0.458
4.916AspTyr: 4.916 ± 1.25
0.0AspXaa: 0.0 ± 0.0
Glu
2.458GluAla: 2.458 ± 1.281
0.41GluCys: 0.41 ± 0.214
6.145GluAsp: 6.145 ± 1.235
2.868GluGlu: 2.868 ± 1.495
4.916GluPhe: 4.916 ± 1.25
2.048GluGly: 2.048 ± 0.244
1.639GluHis: 1.639 ± 0.458
4.916GluIle: 4.916 ± 1.25
5.326GluLys: 5.326 ± 1.464
4.097GluLeu: 4.097 ± 1.145
2.458GluMet: 2.458 ± 0.625
3.277GluAsn: 3.277 ± 1.052
1.229GluPro: 1.229 ± 0.641
1.639GluGln: 1.639 ± 0.854
3.277GluArg: 3.277 ± 0.396
2.458GluSer: 2.458 ± 1.281
4.916GluThr: 4.916 ± 0.062
5.735GluVal: 5.735 ± 0.291
0.41GluTrp: 0.41 ± 0.214
1.639GluTyr: 1.639 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
5.735PheAla: 5.735 ± 1.021
1.229PheCys: 1.229 ± 0.641
3.277PheAsp: 3.277 ± 1.052
2.868PheGlu: 2.868 ± 0.473
1.229PhePhe: 1.229 ± 0.015
4.097PheGly: 4.097 ± 0.823
2.048PheHis: 2.048 ± 0.244
4.097PheIle: 4.097 ± 0.489
2.868PheLys: 2.868 ± 0.183
8.603PheLeu: 8.603 ± 1.204
1.639PheMet: 1.639 ± 0.198
2.868PheAsn: 2.868 ± 0.473
0.819PhePro: 0.819 ± 0.229
0.819PheGln: 0.819 ± 0.229
4.097PheArg: 4.097 ± 0.167
6.145PheSer: 6.145 ± 1.389
2.458PheThr: 2.458 ± 0.687
4.097PheVal: 4.097 ± 0.167
1.229PheTrp: 1.229 ± 0.015
1.229PheTyr: 1.229 ± 0.641
0.0PheXaa: 0.0 ± 0.0
Gly
1.229GlyAla: 1.229 ± 0.015
0.41GlyCys: 0.41 ± 0.214
2.868GlyAsp: 2.868 ± 0.839
2.048GlyGlu: 2.048 ± 0.9
3.277GlyPhe: 3.277 ± 0.916
3.277GlyGly: 3.277 ± 2.884
0.41GlyHis: 0.41 ± 0.214
4.097GlyIle: 4.097 ± 0.823
2.458GlyLys: 2.458 ± 0.687
4.916GlyLeu: 4.916 ± 0.718
1.639GlyMet: 1.639 ± 0.446
3.687GlyAsn: 3.687 ± 0.046
1.639GlyPro: 1.639 ± 0.198
4.097GlyGln: 4.097 ± 0.167
0.819GlyArg: 0.819 ± 0.229
4.097GlySer: 4.097 ± 0.167
4.097GlyThr: 4.097 ± 1.801
3.687GlyVal: 3.687 ± 0.702
0.819GlyTrp: 0.819 ± 0.229
2.868GlyTyr: 2.868 ± 1.129
0.0GlyXaa: 0.0 ± 0.0
His
2.048HisAla: 2.048 ± 1.556
0.0HisCys: 0.0 ± 0.0
1.229HisAsp: 1.229 ± 0.015
0.0HisGlu: 0.0 ± 0.0
2.048HisPhe: 2.048 ± 0.9
1.229HisGly: 1.229 ± 0.015
0.0HisHis: 0.0 ± 0.0
0.819HisIle: 0.819 ± 0.427
0.819HisLys: 0.819 ± 0.427
2.868HisLeu: 2.868 ± 0.839
0.41HisMet: 0.41 ± 0.214
1.639HisAsn: 1.639 ± 0.198
0.819HisPro: 0.819 ± 0.427
0.819HisGln: 0.819 ± 0.427
0.0HisArg: 0.0 ± 0.0
1.639HisSer: 1.639 ± 0.198
0.819HisThr: 0.819 ± 0.229
0.819HisVal: 0.819 ± 0.427
0.0HisTrp: 0.0 ± 0.0
0.41HisTyr: 0.41 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
4.097IleAla: 4.097 ± 0.167
0.819IleCys: 0.819 ± 0.427
3.277IleAsp: 3.277 ± 0.26
6.964IleGlu: 6.964 ± 2.318
3.277IlePhe: 3.277 ± 1.052
2.868IleGly: 2.868 ± 1.785
1.229IleHis: 1.229 ± 0.015
2.868IleIle: 2.868 ± 0.839
2.868IleLys: 2.868 ± 0.839
6.145IleLeu: 6.145 ± 0.579
0.819IleMet: 0.819 ± 0.229
2.048IleAsn: 2.048 ± 0.9
3.687IlePro: 3.687 ± 2.014
0.819IleGln: 0.819 ± 0.427
2.868IleArg: 2.868 ± 0.839
4.097IleSer: 4.097 ± 1.145
4.097IleThr: 4.097 ± 0.489
3.687IleVal: 3.687 ± 0.61
0.819IleTrp: 0.819 ± 0.427
2.048IleTyr: 2.048 ± 0.412
0.0IleXaa: 0.0 ± 0.0
Lys
2.868LysAla: 2.868 ± 0.839
0.41LysCys: 0.41 ± 0.214
5.326LysAsp: 5.326 ± 2.12
4.506LysGlu: 4.506 ± 2.349
5.326LysPhe: 5.326 ± 0.808
0.41LysGly: 0.41 ± 0.214
2.458LysHis: 2.458 ± 1.281
3.277LysIle: 3.277 ± 1.052
3.277LysLys: 3.277 ± 0.26
6.964LysLeu: 6.964 ± 1.662
1.639LysMet: 1.639 ± 0.198
2.868LysAsn: 2.868 ± 1.495
2.458LysPro: 2.458 ± 0.687
2.868LysGln: 2.868 ± 0.183
3.277LysArg: 3.277 ± 1.052
3.277LysSer: 3.277 ± 1.052
3.277LysThr: 3.277 ± 0.916
6.555LysVal: 6.555 ± 1.448
0.0LysTrp: 0.0 ± 0.0
0.819LysTyr: 0.819 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
6.555LeuAla: 6.555 ± 0.52
1.229LeuCys: 1.229 ± 0.015
6.145LeuAsp: 6.145 ± 0.733
6.145LeuGlu: 6.145 ± 1.235
3.277LeuPhe: 3.277 ± 1.052
4.097LeuGly: 4.097 ± 0.823
1.229LeuHis: 1.229 ± 0.671
3.687LeuIle: 3.687 ± 0.046
8.193LeuLys: 8.193 ± 1.646
6.145LeuLeu: 6.145 ± 0.579
3.687LeuMet: 3.687 ± 0.046
4.916LeuAsn: 4.916 ± 0.718
4.097LeuPro: 4.097 ± 0.489
2.868LeuGln: 2.868 ± 0.183
3.687LeuArg: 3.687 ± 0.702
5.735LeuSer: 5.735 ± 0.947
3.687LeuThr: 3.687 ± 1.358
4.097LeuVal: 4.097 ± 0.167
1.229LeuTrp: 1.229 ± 0.641
2.868LeuTyr: 2.868 ± 1.495
0.0LeuXaa: 0.0 ± 0.0
Met
1.229MetAla: 1.229 ± 0.015
0.41MetCys: 0.41 ± 0.214
2.048MetAsp: 2.048 ± 0.412
0.819MetGlu: 0.819 ± 0.229
0.819MetPhe: 0.819 ± 0.229
0.41MetGly: 0.41 ± 0.214
1.229MetHis: 1.229 ± 0.671
1.229MetIle: 1.229 ± 0.015
2.048MetLys: 2.048 ± 0.244
2.048MetLeu: 2.048 ± 1.068
0.41MetMet: 0.41 ± 0.214
2.048MetAsn: 2.048 ± 0.244
0.819MetPro: 0.819 ± 0.427
1.229MetGln: 1.229 ± 0.641
1.639MetArg: 1.639 ± 0.198
0.819MetSer: 0.819 ± 0.427
2.458MetThr: 2.458 ± 0.625
1.639MetVal: 1.639 ± 0.198
0.0MetTrp: 0.0 ± 0.0
1.229MetTyr: 1.229 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.048AsnAla: 2.048 ± 2.212
0.819AsnCys: 0.819 ± 0.229
2.048AsnAsp: 2.048 ± 1.068
5.735AsnGlu: 5.735 ± 1.677
1.229AsnPhe: 1.229 ± 0.015
4.097AsnGly: 4.097 ± 0.823
0.819AsnHis: 0.819 ± 0.427
4.097AsnIle: 4.097 ± 0.167
1.639AsnLys: 1.639 ± 0.198
5.735AsnLeu: 5.735 ± 0.291
0.819AsnMet: 0.819 ± 0.427
2.458AsnAsn: 2.458 ± 1.281
3.277AsnPro: 3.277 ± 0.396
0.819AsnGln: 0.819 ± 0.427
2.868AsnArg: 2.868 ± 0.473
5.735AsnSer: 5.735 ± 0.291
4.097AsnThr: 4.097 ± 3.113
2.868AsnVal: 2.868 ± 1.785
0.41AsnTrp: 0.41 ± 0.442
1.639AsnTyr: 1.639 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
2.868ProAla: 2.868 ± 0.473
0.0ProCys: 0.0 ± 0.0
0.41ProAsp: 0.41 ± 0.214
2.048ProGlu: 2.048 ± 1.068
5.326ProPhe: 5.326 ± 1.16
1.639ProGly: 1.639 ± 0.198
0.0ProHis: 0.0 ± 0.0
2.048ProIle: 2.048 ± 0.9
1.229ProLys: 1.229 ± 0.641
3.687ProLeu: 3.687 ± 1.358
0.819ProMet: 0.819 ± 0.427
3.277ProAsn: 3.277 ± 0.916
1.639ProPro: 1.639 ± 1.114
1.229ProGln: 1.229 ± 0.671
3.687ProArg: 3.687 ± 0.61
5.735ProSer: 5.735 ± 1.603
1.639ProThr: 1.639 ± 0.198
3.687ProVal: 3.687 ± 1.358
1.229ProTrp: 1.229 ± 0.015
2.048ProTyr: 2.048 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
0.41GlnAla: 0.41 ± 0.214
0.41GlnCys: 0.41 ± 0.214
0.819GlnAsp: 0.819 ± 0.885
2.458GlnGlu: 2.458 ± 0.031
2.868GlnPhe: 2.868 ± 0.183
1.639GlnGly: 1.639 ± 0.854
0.819GlnHis: 0.819 ± 0.229
2.048GlnIle: 2.048 ± 0.244
3.277GlnLys: 3.277 ± 1.708
2.868GlnLeu: 2.868 ± 0.183
1.229GlnMet: 1.229 ± 0.015
4.097GlnAsn: 4.097 ± 0.489
2.868GlnPro: 2.868 ± 0.183
1.229GlnGln: 1.229 ± 0.671
2.458GlnArg: 2.458 ± 0.625
2.048GlnSer: 2.048 ± 0.412
2.048GlnThr: 2.048 ± 0.244
0.819GlnVal: 0.819 ± 0.427
0.41GlnTrp: 0.41 ± 0.214
1.639GlnTyr: 1.639 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
2.868ArgAla: 2.868 ± 0.839
0.819ArgCys: 0.819 ± 0.427
1.639ArgAsp: 1.639 ± 0.458
2.048ArgGlu: 2.048 ± 0.9
1.229ArgPhe: 1.229 ± 0.015
3.277ArgGly: 3.277 ± 0.26
0.819ArgHis: 0.819 ± 0.427
3.687ArgIle: 3.687 ± 0.61
4.097ArgLys: 4.097 ± 1.479
4.097ArgLeu: 4.097 ± 0.489
1.229ArgMet: 1.229 ± 0.015
3.687ArgAsn: 3.687 ± 0.61
1.639ArgPro: 1.639 ± 0.458
2.868ArgGln: 2.868 ± 1.495
3.277ArgArg: 3.277 ± 1.052
2.868ArgSer: 2.868 ± 0.839
2.868ArgThr: 2.868 ± 0.183
3.277ArgVal: 3.277 ± 0.396
0.41ArgTrp: 0.41 ± 0.214
2.868ArgTyr: 2.868 ± 1.129
0.0ArgXaa: 0.0 ± 0.0
Ser
5.735SerAla: 5.735 ± 1.603
1.639SerCys: 1.639 ± 0.198
3.277SerAsp: 3.277 ± 0.916
4.916SerGlu: 4.916 ± 1.25
5.735SerPhe: 5.735 ± 0.291
3.687SerGly: 3.687 ± 1.358
2.048SerHis: 2.048 ± 0.412
7.784SerIle: 7.784 ± 3.159
6.964SerLys: 6.964 ± 2.318
1.639SerLeu: 1.639 ± 0.854
1.639SerMet: 1.639 ± 1.114
3.687SerAsn: 3.687 ± 2.67
2.868SerPro: 2.868 ± 0.473
2.458SerGln: 2.458 ± 0.031
4.506SerArg: 4.506 ± 1.037
4.916SerSer: 4.916 ± 0.062
5.326SerThr: 5.326 ± 1.16
5.326SerVal: 5.326 ± 0.152
0.819SerTrp: 0.819 ± 0.229
2.458SerTyr: 2.458 ± 0.687
0.0SerXaa: 0.0 ± 0.0
Thr
6.555ThrAla: 6.555 ± 1.176
1.639ThrCys: 1.639 ± 0.458
2.458ThrAsp: 2.458 ± 1.999
3.687ThrGlu: 3.687 ± 1.266
2.458ThrPhe: 2.458 ± 0.031
6.145ThrGly: 6.145 ± 5.325
1.229ThrHis: 1.229 ± 0.671
3.687ThrIle: 3.687 ± 0.702
3.687ThrLys: 3.687 ± 1.922
3.277ThrLeu: 3.277 ± 0.916
0.41ThrMet: 0.41 ± 0.214
2.458ThrAsn: 2.458 ± 0.031
3.277ThrPro: 3.277 ± 2.228
2.458ThrGln: 2.458 ± 1.343
2.868ThrArg: 2.868 ± 0.183
4.916ThrSer: 4.916 ± 2.686
5.326ThrThr: 5.326 ± 1.816
4.097ThrVal: 4.097 ± 1.145
0.0ThrTrp: 0.0 ± 0.0
2.048ThrTyr: 2.048 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
5.735ValAla: 5.735 ± 2.259
1.639ValCys: 1.639 ± 0.854
4.097ValAsp: 4.097 ± 1.479
4.097ValGlu: 4.097 ± 0.167
6.555ValPhe: 6.555 ± 0.52
3.687ValGly: 3.687 ± 1.922
0.41ValHis: 0.41 ± 0.214
1.639ValIle: 1.639 ± 0.854
4.097ValLys: 4.097 ± 0.167
9.013ValLeu: 9.013 ± 0.551
1.229ValMet: 1.229 ± 0.759
2.458ValAsn: 2.458 ± 0.031
5.326ValPro: 5.326 ± 1.16
3.687ValGln: 3.687 ± 0.046
1.639ValArg: 1.639 ± 0.458
6.145ValSer: 6.145 ± 0.579
2.868ValThr: 2.868 ± 1.785
4.506ValVal: 4.506 ± 1.587
0.819ValTrp: 0.819 ± 0.427
1.229ValTyr: 1.229 ± 0.015
0.0ValXaa: 0.0 ± 0.0
Trp
0.41TrpAla: 0.41 ± 0.442
0.41TrpCys: 0.41 ± 0.442
1.639TrpAsp: 1.639 ± 0.854
0.819TrpGlu: 0.819 ± 0.427
0.819TrpPhe: 0.819 ± 0.427
0.819TrpGly: 0.819 ± 0.229
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.41TrpLeu: 0.41 ± 0.214
1.229TrpMet: 1.229 ± 0.015
1.229TrpAsn: 1.229 ± 0.015
0.41TrpPro: 0.41 ± 0.442
0.819TrpGln: 0.819 ± 0.427
0.0TrpArg: 0.0 ± 0.0
1.229TrpSer: 1.229 ± 0.015
0.41TrpThr: 0.41 ± 0.442
0.41TrpVal: 0.41 ± 0.214
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.048TyrAla: 2.048 ± 0.244
0.41TyrCys: 0.41 ± 0.214
2.048TyrAsp: 2.048 ± 1.068
1.639TyrGlu: 1.639 ± 0.198
2.458TyrPhe: 2.458 ± 0.031
1.639TyrGly: 1.639 ± 0.458
1.229TyrHis: 1.229 ± 0.641
1.639TyrIle: 1.639 ± 0.198
1.229TyrLys: 1.229 ± 0.641
3.277TyrLeu: 3.277 ± 0.26
0.819TyrMet: 0.819 ± 0.229
2.048TyrAsn: 2.048 ± 0.9
0.41TyrPro: 0.41 ± 0.214
1.639TyrGln: 1.639 ± 0.198
1.229TyrArg: 1.229 ± 1.327
4.506TyrSer: 4.506 ± 0.381
3.277TyrThr: 3.277 ± 0.916
4.916TyrVal: 4.916 ± 1.25
0.41TyrTrp: 0.41 ± 0.442
2.458TyrTyr: 2.458 ± 1.999
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2442 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski