Amino acid dipepetide frequency for Wenzhou picorna-like virus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.891AlaAla: 5.891 ± 0.472
0.736AlaCys: 0.736 ± 0.392
3.314AlaAsp: 3.314 ± 0.562
3.314AlaGlu: 3.314 ± 0.04
4.786AlaPhe: 4.786 ± 1.06
4.786AlaGly: 4.786 ± 1.347
0.736AlaHis: 0.736 ± 0.392
4.786AlaIle: 4.786 ± 2.264
4.05AlaLys: 4.05 ± 0.955
4.05AlaLeu: 4.05 ± 2.159
1.841AlaMet: 1.841 ± 0.223
1.841AlaAsn: 1.841 ± 0.824
4.786AlaPro: 4.786 ± 0.143
5.891AlaGln: 5.891 ± 0.472
4.418AlaArg: 4.418 ± 1.151
4.418AlaSer: 4.418 ± 0.655
5.523AlaThr: 5.523 ± 0.668
6.627AlaVal: 6.627 ± 0.523
1.841AlaTrp: 1.841 ± 0.379
3.314AlaTyr: 3.314 ± 3.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.473CysAla: 1.473 ± 0.785
0.368CysCys: 0.368 ± 0.196
1.105CysAsp: 1.105 ± 0.589
0.368CysGlu: 0.368 ± 0.196
0.736CysPhe: 0.736 ± 0.209
1.105CysGly: 1.105 ± 0.589
0.0CysHis: 0.0 ± 0.0
0.368CysIle: 0.368 ± 0.406
0.368CysLys: 0.368 ± 0.196
1.105CysLeu: 1.105 ± 0.013
0.0CysMet: 0.0 ± 0.0
0.736CysAsn: 0.736 ± 0.392
1.473CysPro: 1.473 ± 0.419
0.0CysGln: 0.0 ± 0.0
1.105CysArg: 1.105 ± 0.589
0.736CysSer: 0.736 ± 0.209
1.841CysThr: 1.841 ± 0.379
0.736CysVal: 0.736 ± 0.392
0.0CysTrp: 0.0 ± 0.0
0.368CysTyr: 0.368 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
4.786AspAla: 4.786 ± 0.458
0.736AspCys: 0.736 ± 0.392
4.786AspAsp: 4.786 ± 0.458
4.418AspGlu: 4.418 ± 0.549
3.314AspPhe: 3.314 ± 0.562
5.523AspGly: 5.523 ± 0.536
3.314AspHis: 3.314 ± 1.164
2.946AspIle: 2.946 ± 0.838
1.473AspLys: 1.473 ± 1.021
3.682AspLeu: 3.682 ± 0.445
0.368AspMet: 0.368 ± 0.196
1.841AspAsn: 1.841 ± 0.223
4.418AspPro: 4.418 ± 0.053
0.736AspGln: 0.736 ± 0.392
1.841AspArg: 1.841 ± 0.379
2.946AspSer: 2.946 ± 0.236
3.314AspThr: 3.314 ± 0.04
5.155AspVal: 5.155 ± 0.34
0.368AspTrp: 0.368 ± 0.196
2.209AspTyr: 2.209 ± 1.23
0.0AspXaa: 0.0 ± 0.0
Glu
3.314GluAla: 3.314 ± 0.562
0.0GluCys: 0.0 ± 0.0
1.841GluAsp: 1.841 ± 1.426
4.786GluGlu: 4.786 ± 0.143
2.577GluPhe: 2.577 ± 0.432
3.682GluGly: 3.682 ± 0.759
2.946GluHis: 2.946 ± 0.236
7.364GluIle: 7.364 ± 2.721
4.786GluLys: 4.786 ± 0.745
6.996GluLeu: 6.996 ± 0.117
2.209GluMet: 2.209 ± 0.026
2.209GluAsn: 2.209 ± 0.026
2.577GluPro: 2.577 ± 0.772
4.05GluGln: 4.05 ± 0.353
4.05GluArg: 4.05 ± 2.159
2.946GluSer: 2.946 ± 1.57
2.577GluThr: 2.577 ± 0.17
3.682GluVal: 3.682 ± 2.251
1.473GluTrp: 1.473 ± 0.419
2.209GluTyr: 2.209 ± 0.576
0.0GluXaa: 0.0 ± 0.0
Phe
5.155PheAla: 5.155 ± 0.864
0.368PheCys: 0.368 ± 0.196
1.841PheAsp: 1.841 ± 0.379
4.05PheGlu: 4.05 ± 1.453
1.105PhePhe: 1.105 ± 1.217
2.946PheGly: 2.946 ± 0.838
0.736PheHis: 0.736 ± 0.209
0.368PheIle: 0.368 ± 0.196
1.841PheLys: 1.841 ± 0.981
1.473PheLeu: 1.473 ± 0.785
1.105PheMet: 1.105 ± 0.013
4.418PheAsn: 4.418 ± 0.655
2.946PhePro: 2.946 ± 2.041
1.473PheGln: 1.473 ± 0.419
2.577PheArg: 2.577 ± 0.17
2.577PheSer: 2.577 ± 0.17
2.946PheThr: 2.946 ± 0.838
1.841PheVal: 1.841 ± 0.223
0.368PheTrp: 0.368 ± 0.406
0.368PheTyr: 0.368 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
3.682GlyAla: 3.682 ± 2.251
1.473GlyCys: 1.473 ± 0.785
6.627GlyAsp: 6.627 ± 2.93
6.259GlyGlu: 6.259 ± 1.479
2.577GlyPhe: 2.577 ± 0.432
3.682GlyGly: 3.682 ± 0.445
1.473GlyHis: 1.473 ± 0.785
4.05GlyIle: 4.05 ± 1.453
4.05GlyLys: 4.05 ± 0.955
6.996GlyLeu: 6.996 ± 1.321
3.314GlyMet: 3.314 ± 0.641
1.105GlyAsn: 1.105 ± 0.013
3.682GlyPro: 3.682 ± 0.445
1.473GlyGln: 1.473 ± 0.183
2.946GlyArg: 2.946 ± 0.968
5.155GlySer: 5.155 ± 1.466
5.523GlyThr: 5.523 ± 1.27
4.418GlyVal: 4.418 ± 1.151
1.105GlyTrp: 1.105 ± 0.013
1.841GlyTyr: 1.841 ± 0.824
0.0GlyXaa: 0.0 ± 0.0
His
1.841HisAla: 1.841 ± 0.223
0.736HisCys: 0.736 ± 0.392
1.473HisAsp: 1.473 ± 0.183
0.736HisGlu: 0.736 ± 0.392
1.473HisPhe: 1.473 ± 0.183
1.473HisGly: 1.473 ± 0.785
0.368HisHis: 0.368 ± 0.196
0.736HisIle: 0.736 ± 0.209
0.368HisLys: 0.368 ± 0.196
1.841HisLeu: 1.841 ± 0.981
0.736HisMet: 0.736 ± 0.392
0.736HisAsn: 0.736 ± 0.209
1.105HisPro: 1.105 ± 0.013
0.368HisGln: 0.368 ± 0.196
1.473HisArg: 1.473 ± 0.785
1.105HisSer: 1.105 ± 0.013
3.314HisThr: 3.314 ± 0.562
2.209HisVal: 2.209 ± 1.177
0.368HisTrp: 0.368 ± 0.196
1.105HisTyr: 1.105 ± 0.615
0.0HisXaa: 0.0 ± 0.0
Ile
4.786IleAla: 4.786 ± 1.347
1.473IleCys: 1.473 ± 0.785
4.418IleAsp: 4.418 ± 0.053
3.682IleGlu: 3.682 ± 0.157
2.946IlePhe: 2.946 ± 0.838
3.682IleGly: 3.682 ± 0.445
0.736IleHis: 0.736 ± 0.209
2.577IleIle: 2.577 ± 0.772
2.946IleLys: 2.946 ± 0.236
3.682IleLeu: 3.682 ± 0.157
0.736IleMet: 0.736 ± 0.433
1.841IleAsn: 1.841 ± 0.223
3.314IlePro: 3.314 ± 0.04
1.841IleGln: 1.841 ± 0.379
4.05IleArg: 4.05 ± 1.453
4.05IleSer: 4.05 ± 0.851
4.05IleThr: 4.05 ± 0.353
4.05IleVal: 4.05 ± 0.353
0.736IleTrp: 0.736 ± 0.392
1.473IleTyr: 1.473 ± 0.183
0.0IleXaa: 0.0 ± 0.0
Lys
4.786LysAla: 4.786 ± 1.949
0.368LysCys: 0.368 ± 0.196
2.209LysAsp: 2.209 ± 1.177
5.155LysGlu: 5.155 ± 1.543
1.841LysPhe: 1.841 ± 0.223
2.946LysGly: 2.946 ± 0.366
0.0LysHis: 0.0 ± 0.0
4.418LysIle: 4.418 ± 1.151
4.786LysLys: 4.786 ± 1.949
3.314LysLeu: 3.314 ± 0.04
1.473LysMet: 1.473 ± 0.183
1.841LysAsn: 1.841 ± 0.379
2.946LysPro: 2.946 ± 0.838
1.473LysGln: 1.473 ± 0.785
1.841LysArg: 1.841 ± 0.379
3.314LysSer: 3.314 ± 0.562
2.946LysThr: 2.946 ± 0.968
4.418LysVal: 4.418 ± 1.151
0.368LysTrp: 0.368 ± 0.196
2.577LysTyr: 2.577 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
6.996LeuAla: 6.996 ± 2.525
2.209LeuCys: 2.209 ± 0.576
7.732LeuAsp: 7.732 ± 0.092
4.05LeuGlu: 4.05 ± 0.955
2.946LeuPhe: 2.946 ± 0.236
4.786LeuGly: 4.786 ± 1.06
3.314LeuHis: 3.314 ± 1.164
4.418LeuIle: 4.418 ± 0.053
4.786LeuLys: 4.786 ± 1.949
4.05LeuLeu: 4.05 ± 1.557
1.473LeuMet: 1.473 ± 0.785
3.314LeuAsn: 3.314 ± 0.641
1.841LeuPro: 1.841 ± 0.379
1.841LeuGln: 1.841 ± 0.379
5.523LeuArg: 5.523 ± 0.066
3.682LeuSer: 3.682 ± 0.157
7.732LeuThr: 7.732 ± 0.51
5.523LeuVal: 5.523 ± 1.138
1.105LeuTrp: 1.105 ± 0.589
2.577LeuTyr: 2.577 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
2.577MetAla: 2.577 ± 0.17
0.0MetCys: 0.0 ± 0.0
1.841MetAsp: 1.841 ± 1.426
1.841MetGlu: 1.841 ± 0.981
0.368MetPhe: 0.368 ± 0.406
2.209MetGly: 2.209 ± 0.628
0.0MetHis: 0.0 ± 0.0
0.736MetIle: 0.736 ± 0.392
1.473MetLys: 1.473 ± 0.785
1.473MetLeu: 1.473 ± 0.785
1.473MetMet: 1.473 ± 0.183
0.0MetAsn: 0.0 ± 0.0
1.841MetPro: 1.841 ± 0.824
1.105MetGln: 1.105 ± 0.589
4.05MetArg: 4.05 ± 0.353
0.368MetSer: 0.368 ± 0.406
1.105MetThr: 1.105 ± 0.589
1.105MetVal: 1.105 ± 0.589
1.473MetTrp: 1.473 ± 0.183
1.105MetTyr: 1.105 ± 0.615
0.0MetXaa: 0.0 ± 0.0
Asn
4.786AsnAla: 4.786 ± 1.662
1.105AsnCys: 1.105 ± 0.615
1.841AsnAsp: 1.841 ± 0.824
1.841AsnGlu: 1.841 ± 0.824
0.736AsnPhe: 0.736 ± 0.392
3.314AsnGly: 3.314 ± 1.243
0.736AsnHis: 0.736 ± 0.392
1.105AsnIle: 1.105 ± 0.013
1.105AsnLys: 1.105 ± 0.013
4.418AsnLeu: 4.418 ± 1.858
2.577AsnMet: 2.577 ± 0.152
2.209AsnAsn: 2.209 ± 0.628
0.736AsnPro: 0.736 ± 0.209
1.473AsnGln: 1.473 ± 0.419
1.473AsnArg: 1.473 ± 0.183
2.946AsnSer: 2.946 ± 0.968
1.105AsnThr: 1.105 ± 0.589
5.155AsnVal: 5.155 ± 1.466
0.736AsnTrp: 0.736 ± 0.209
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.946ProAla: 2.946 ± 0.838
0.0ProCys: 0.0 ± 0.0
2.577ProAsp: 2.577 ± 1.034
3.314ProGlu: 3.314 ± 1.164
1.841ProPhe: 1.841 ± 0.824
2.946ProGly: 2.946 ± 0.838
0.368ProHis: 0.368 ± 0.196
4.418ProIle: 4.418 ± 1.858
1.473ProLys: 1.473 ± 0.183
7.732ProLeu: 7.732 ± 0.51
1.473ProMet: 1.473 ± 0.419
1.473ProAsn: 1.473 ± 0.419
2.577ProPro: 2.577 ± 1.034
0.736ProGln: 0.736 ± 0.392
1.473ProArg: 1.473 ± 0.183
3.682ProSer: 3.682 ± 0.445
3.682ProThr: 3.682 ± 1.047
4.786ProVal: 4.786 ± 1.662
0.736ProTrp: 0.736 ± 0.811
3.682ProTyr: 3.682 ± 1.649
0.0ProXaa: 0.0 ± 0.0
Gln
1.473GlnAla: 1.473 ± 0.785
0.368GlnCys: 0.368 ± 0.196
1.105GlnAsp: 1.105 ± 0.013
2.946GlnGlu: 2.946 ± 0.968
0.0GlnPhe: 0.0 ± 0.0
2.577GlnGly: 2.577 ± 0.17
0.368GlnHis: 0.368 ± 0.196
2.577GlnIle: 2.577 ± 0.772
2.209GlnLys: 2.209 ± 1.177
4.786GlnLeu: 4.786 ± 1.347
0.736GlnMet: 0.736 ± 0.392
0.368GlnAsn: 0.368 ± 0.406
1.841GlnPro: 1.841 ± 0.223
1.105GlnGln: 1.105 ± 0.013
0.736GlnArg: 0.736 ± 0.392
1.473GlnSer: 1.473 ± 1.021
1.105GlnThr: 1.105 ± 0.615
2.946GlnVal: 2.946 ± 0.838
1.473GlnTrp: 1.473 ± 0.419
1.841GlnTyr: 1.841 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
2.946ArgAla: 2.946 ± 0.236
0.736ArgCys: 0.736 ± 0.811
2.577ArgAsp: 2.577 ± 0.17
3.682ArgGlu: 3.682 ± 1.962
4.786ArgPhe: 4.786 ± 0.745
4.05ArgGly: 4.05 ± 0.353
1.473ArgHis: 1.473 ± 0.183
3.314ArgIle: 3.314 ± 0.04
2.577ArgLys: 2.577 ± 1.374
4.786ArgLeu: 4.786 ± 0.745
0.0ArgMet: 0.0 ± 0.0
1.473ArgAsn: 1.473 ± 0.183
2.946ArgPro: 2.946 ± 1.44
1.105ArgGln: 1.105 ± 0.013
4.786ArgArg: 4.786 ± 1.06
3.682ArgSer: 3.682 ± 0.445
4.418ArgThr: 4.418 ± 2.355
4.418ArgVal: 4.418 ± 1.753
1.105ArgTrp: 1.105 ± 0.013
1.473ArgTyr: 1.473 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
3.314SerAla: 3.314 ± 0.641
0.368SerCys: 0.368 ± 0.406
2.209SerAsp: 2.209 ± 0.628
3.682SerGlu: 3.682 ± 1.047
1.105SerPhe: 1.105 ± 0.013
7.732SerGly: 7.732 ± 0.092
1.841SerHis: 1.841 ± 0.223
2.209SerIle: 2.209 ± 1.177
3.682SerLys: 3.682 ± 0.445
5.523SerLeu: 5.523 ± 0.536
1.473SerMet: 1.473 ± 0.419
3.682SerAsn: 3.682 ± 1.047
2.946SerPro: 2.946 ± 0.838
1.473SerGln: 1.473 ± 0.785
1.473SerArg: 1.473 ± 0.419
2.577SerSer: 2.577 ± 1.034
7.364SerThr: 7.364 ± 2.696
4.05SerVal: 4.05 ± 0.851
0.368SerTrp: 0.368 ± 0.406
2.209SerTyr: 2.209 ± 0.576
0.0SerXaa: 0.0 ± 0.0
Thr
5.523ThrAla: 5.523 ± 0.066
0.368ThrCys: 0.368 ± 0.196
3.682ThrAsp: 3.682 ± 0.759
3.314ThrGlu: 3.314 ± 0.562
1.841ThrPhe: 1.841 ± 0.824
5.523ThrGly: 5.523 ± 0.668
1.473ThrHis: 1.473 ± 0.785
4.418ThrIle: 4.418 ± 0.053
3.314ThrLys: 3.314 ± 1.766
4.05ThrLeu: 4.05 ± 1.557
2.209ThrMet: 2.209 ± 0.576
3.682ThrAsn: 3.682 ± 2.251
2.946ThrPro: 2.946 ± 0.236
1.473ThrGln: 1.473 ± 0.785
5.523ThrArg: 5.523 ± 1.138
5.155ThrSer: 5.155 ± 2.67
3.314ThrThr: 3.314 ± 1.845
6.996ThrVal: 6.996 ± 0.485
0.0ThrTrp: 0.0 ± 0.0
2.577ThrTyr: 2.577 ± 1.034
0.0ThrXaa: 0.0 ± 0.0
Val
7.732ValAla: 7.732 ± 0.092
1.105ValCys: 1.105 ± 0.589
4.05ValAsp: 4.05 ± 0.851
7.364ValGlu: 7.364 ± 1.517
1.841ValPhe: 1.841 ± 0.223
6.259ValGly: 6.259 ± 0.877
2.946ValHis: 2.946 ± 0.968
3.314ValIle: 3.314 ± 1.164
3.682ValLys: 3.682 ± 0.157
4.786ValLeu: 4.786 ± 0.143
0.736ValMet: 0.736 ± 0.392
3.682ValAsn: 3.682 ± 0.759
4.786ValPro: 4.786 ± 2.866
2.946ValGln: 2.946 ± 0.838
3.682ValArg: 3.682 ± 0.759
5.523ValSer: 5.523 ± 1.27
1.473ValThr: 1.473 ± 0.183
5.523ValVal: 5.523 ± 1.138
1.841ValTrp: 1.841 ± 0.824
2.577ValTyr: 2.577 ± 0.772
0.0ValXaa: 0.0 ± 0.0
Trp
0.368TrpAla: 0.368 ± 0.196
0.368TrpCys: 0.368 ± 0.196
0.736TrpAsp: 0.736 ± 0.392
0.368TrpGlu: 0.368 ± 0.406
0.736TrpPhe: 0.736 ± 0.209
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.473TrpIle: 1.473 ± 0.419
1.473TrpLys: 1.473 ± 0.419
2.577TrpLeu: 2.577 ± 0.17
0.736TrpMet: 0.736 ± 0.209
0.736TrpAsn: 0.736 ± 0.209
0.368TrpPro: 0.368 ± 0.196
0.736TrpGln: 0.736 ± 0.209
0.736TrpArg: 0.736 ± 0.811
1.473TrpSer: 1.473 ± 0.183
2.577TrpThr: 2.577 ± 0.17
0.736TrpVal: 0.736 ± 0.209
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.577TyrAla: 2.577 ± 2.238
1.105TyrCys: 1.105 ± 0.013
2.209TyrAsp: 2.209 ± 0.628
1.105TyrGlu: 1.105 ± 0.013
2.946TyrPhe: 2.946 ± 0.838
2.209TyrGly: 2.209 ± 0.576
0.736TyrHis: 0.736 ± 0.209
1.841TyrIle: 1.841 ± 0.824
2.577TyrLys: 2.577 ± 0.17
2.577TyrLeu: 2.577 ± 0.432
1.105TyrMet: 1.105 ± 0.589
2.209TyrAsn: 2.209 ± 1.23
1.841TyrPro: 1.841 ± 0.223
0.736TyrGln: 0.736 ± 0.392
2.577TyrArg: 2.577 ± 0.17
1.473TyrSer: 1.473 ± 1.021
1.473TyrThr: 1.473 ± 0.183
1.473TyrVal: 1.473 ± 1.021
0.736TyrTrp: 0.736 ± 0.209
1.105TyrTyr: 1.105 ± 1.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2717 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski