Amino acid dipepetide frequency for Hubei picorna-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.114AlaAla: 5.114 ± 0.102
0.787AlaCys: 0.787 ± 0.373
3.541AlaAsp: 3.541 ± 0.216
2.754AlaGlu: 2.754 ± 0.042
3.934AlaPhe: 3.934 ± 1.863
4.327AlaGly: 4.327 ± 2.999
2.36AlaHis: 2.36 ± 0.144
3.934AlaIle: 3.934 ± 0.601
3.541AlaLys: 3.541 ± 0.415
7.081AlaLeu: 7.081 ± 0.433
1.18AlaMet: 1.18 ± 0.936
2.754AlaAsn: 2.754 ± 1.22
1.18AlaPro: 1.18 ± 0.703
0.393AlaGln: 0.393 ± 0.186
3.541AlaArg: 3.541 ± 0.216
6.688AlaSer: 6.688 ± 0.643
3.934AlaThr: 3.934 ± 1.292
5.507AlaVal: 5.507 ± 0.547
0.0AlaTrp: 0.0 ± 0.0
1.574AlaTyr: 1.574 ± 0.114
0.0AlaXaa: 0.0 ± 0.0
Cys
1.967CysAla: 1.967 ± 0.932
0.393CysCys: 0.393 ± 0.186
1.18CysAsp: 1.18 ± 0.559
0.0CysGlu: 0.0 ± 0.0
1.574CysPhe: 1.574 ± 0.114
1.574CysGly: 1.574 ± 0.745
0.0CysHis: 0.0 ± 0.0
0.393CysIle: 0.393 ± 0.186
1.18CysLys: 1.18 ± 0.072
0.787CysLeu: 0.787 ± 0.373
0.0CysMet: 0.0 ± 0.0
0.787CysAsn: 0.787 ± 0.889
0.787CysPro: 0.787 ± 0.258
0.0CysGln: 0.0 ± 0.0
0.393CysArg: 0.393 ± 0.186
1.574CysSer: 1.574 ± 0.517
0.787CysThr: 0.787 ± 0.373
1.967CysVal: 1.967 ± 0.301
0.393CysTrp: 0.393 ± 0.186
1.18CysTyr: 1.18 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
2.754AspAla: 2.754 ± 1.304
1.18AspCys: 1.18 ± 0.559
2.36AspAsp: 2.36 ± 1.118
1.967AspGlu: 1.967 ± 0.932
5.114AspPhe: 5.114 ± 1.16
2.36AspGly: 2.36 ± 0.487
1.574AspHis: 1.574 ± 0.114
3.147AspIle: 3.147 ± 0.228
1.574AspLys: 1.574 ± 0.114
2.754AspLeu: 2.754 ± 0.042
0.787AspMet: 0.787 ± 0.316
1.18AspAsn: 1.18 ± 0.559
2.754AspPro: 2.754 ± 0.673
1.574AspGln: 1.574 ± 1.148
2.754AspArg: 2.754 ± 0.042
3.934AspSer: 3.934 ± 0.661
2.754AspThr: 2.754 ± 1.22
2.36AspVal: 2.36 ± 0.144
0.787AspTrp: 0.787 ± 0.373
2.36AspTyr: 2.36 ± 0.775
0.0AspXaa: 0.0 ± 0.0
Glu
1.574GluAla: 1.574 ± 0.114
1.18GluCys: 1.18 ± 0.559
4.327GluAsp: 4.327 ± 0.475
2.36GluGlu: 2.36 ± 1.118
4.721GluPhe: 4.721 ± 0.288
2.754GluGly: 2.754 ± 0.673
1.574GluHis: 1.574 ± 0.745
0.787GluIle: 0.787 ± 0.373
4.327GluLys: 4.327 ± 0.787
4.721GluLeu: 4.721 ± 0.288
1.967GluMet: 1.967 ± 0.331
2.754GluAsn: 2.754 ± 0.673
1.967GluPro: 1.967 ± 0.301
3.934GluGln: 3.934 ± 0.601
2.36GluArg: 2.36 ± 1.118
4.721GluSer: 4.721 ± 0.974
2.36GluThr: 2.36 ± 0.144
6.294GluVal: 6.294 ± 0.457
0.787GluTrp: 0.787 ± 0.373
1.574GluTyr: 1.574 ± 0.745
0.0GluXaa: 0.0 ± 0.0
Phe
3.541PheAla: 3.541 ± 0.216
1.18PheCys: 1.18 ± 0.072
3.541PheAsp: 3.541 ± 0.415
3.934PheGlu: 3.934 ± 0.03
2.754PhePhe: 2.754 ± 1.304
4.721PheGly: 4.721 ± 0.974
0.787PheHis: 0.787 ± 0.258
1.18PheIle: 1.18 ± 0.559
1.574PheLys: 1.574 ± 0.745
5.901PheLeu: 5.901 ± 0.902
1.967PheMet: 1.967 ± 0.331
4.721PheAsn: 4.721 ± 1.605
2.754PhePro: 2.754 ± 0.673
1.18PheGln: 1.18 ± 0.559
1.967PheArg: 1.967 ± 0.331
7.081PheSer: 7.081 ± 2.092
2.36PheThr: 2.36 ± 0.144
3.147PheVal: 3.147 ± 0.228
0.393PheTrp: 0.393 ± 0.186
4.327PheTyr: 4.327 ± 1.106
0.0PheXaa: 0.0 ± 0.0
Gly
5.901GlyAla: 5.901 ± 2.254
1.18GlyCys: 1.18 ± 0.072
2.754GlyAsp: 2.754 ± 0.042
3.934GlyGlu: 3.934 ± 0.03
1.967GlyPhe: 1.967 ± 0.301
3.934GlyGly: 3.934 ± 0.03
1.574GlyHis: 1.574 ± 0.745
3.541GlyIle: 3.541 ± 0.216
5.507GlyLys: 5.507 ± 1.977
5.901GlyLeu: 5.901 ± 1.533
1.574GlyMet: 1.574 ± 0.517
0.393GlyAsn: 0.393 ± 0.186
2.36GlyPro: 2.36 ± 0.775
0.787GlyGln: 0.787 ± 0.373
3.147GlyArg: 3.147 ± 1.034
5.114GlySer: 5.114 ± 1.995
5.114GlyThr: 5.114 ± 1.995
5.901GlyVal: 5.901 ± 0.902
0.787GlyTrp: 0.787 ± 0.373
3.934GlyTyr: 3.934 ± 1.292
0.0GlyXaa: 0.0 ± 0.0
His
1.18HisAla: 1.18 ± 0.072
0.393HisCys: 0.393 ± 0.445
1.18HisAsp: 1.18 ± 1.334
0.393HisGlu: 0.393 ± 0.186
1.18HisPhe: 1.18 ± 0.072
1.574HisGly: 1.574 ± 0.114
0.0HisHis: 0.0 ± 0.0
1.574HisIle: 1.574 ± 0.745
1.967HisLys: 1.967 ± 0.301
1.574HisLeu: 1.574 ± 0.745
0.0HisMet: 0.0 ± 0.0
1.18HisAsn: 1.18 ± 0.559
1.18HisPro: 1.18 ± 0.559
1.574HisGln: 1.574 ± 0.745
1.18HisArg: 1.18 ± 0.072
1.18HisSer: 1.18 ± 0.072
0.0HisThr: 0.0 ± 0.0
1.967HisVal: 1.967 ± 0.331
0.393HisTrp: 0.393 ± 0.186
0.393HisTyr: 0.393 ± 0.445
0.0HisXaa: 0.0 ± 0.0
Ile
4.721IleAla: 4.721 ± 0.343
0.393IleCys: 0.393 ± 0.186
1.574IleAsp: 1.574 ± 0.517
1.574IleGlu: 1.574 ± 0.114
1.967IlePhe: 1.967 ± 0.301
3.147IleGly: 3.147 ± 0.859
1.574IleHis: 1.574 ± 0.745
2.36IleIle: 2.36 ± 1.118
1.574IleLys: 1.574 ± 0.114
4.327IleLeu: 4.327 ± 1.418
1.18IleMet: 1.18 ± 0.559
2.754IleAsn: 2.754 ± 0.589
4.327IlePro: 4.327 ± 1.737
1.574IleGln: 1.574 ± 0.114
3.541IleArg: 3.541 ± 0.216
3.934IleSer: 3.934 ± 0.661
3.147IleThr: 3.147 ± 1.034
4.327IleVal: 4.327 ± 0.156
1.18IleTrp: 1.18 ± 0.072
1.574IleTyr: 1.574 ± 0.517
0.0IleXaa: 0.0 ± 0.0
Lys
3.541LysAla: 3.541 ± 1.046
0.0LysCys: 0.0 ± 0.0
2.754LysAsp: 2.754 ± 1.304
4.327LysGlu: 4.327 ± 0.787
4.327LysPhe: 4.327 ± 2.049
2.754LysGly: 2.754 ± 0.673
0.787LysHis: 0.787 ± 0.258
3.147LysIle: 3.147 ± 0.403
4.327LysLys: 4.327 ± 2.049
3.541LysLeu: 3.541 ± 0.216
1.18LysMet: 1.18 ± 0.559
1.574LysAsn: 1.574 ± 0.745
2.754LysPro: 2.754 ± 0.673
1.574LysGln: 1.574 ± 0.517
3.934LysArg: 3.934 ± 0.601
3.147LysSer: 3.147 ± 1.491
4.327LysThr: 4.327 ± 0.156
3.147LysVal: 3.147 ± 1.034
1.18LysTrp: 1.18 ± 0.072
1.574LysTyr: 1.574 ± 0.114
0.0LysXaa: 0.0 ± 0.0
Leu
6.294LeuAla: 6.294 ± 0.805
2.36LeuCys: 2.36 ± 0.487
3.934LeuAsp: 3.934 ± 1.232
8.261LeuGlu: 8.261 ± 0.126
5.114LeuPhe: 5.114 ± 1.791
6.294LeuGly: 6.294 ± 0.805
1.18LeuHis: 1.18 ± 0.559
3.147LeuIle: 3.147 ± 0.859
5.114LeuLys: 5.114 ± 0.733
9.048LeuLeu: 9.048 ± 0.499
1.18LeuMet: 1.18 ± 0.559
6.294LeuAsn: 6.294 ± 2.698
7.081LeuPro: 7.081 ± 1.064
2.754LeuGln: 2.754 ± 0.673
3.541LeuArg: 3.541 ± 0.216
6.294LeuSer: 6.294 ± 0.174
5.901LeuThr: 5.901 ± 0.361
4.721LeuVal: 4.721 ± 0.343
0.393LeuTrp: 0.393 ± 0.186
4.327LeuTyr: 4.327 ± 1.418
0.0LeuXaa: 0.0 ± 0.0
Met
1.574MetAla: 1.574 ± 0.114
0.393MetCys: 0.393 ± 0.186
1.18MetAsp: 1.18 ± 0.703
1.967MetGlu: 1.967 ± 0.301
0.787MetPhe: 0.787 ± 0.373
1.574MetGly: 1.574 ± 0.114
1.18MetHis: 1.18 ± 0.703
0.787MetIle: 0.787 ± 0.373
0.393MetLys: 0.393 ± 0.186
2.754MetLeu: 2.754 ± 1.304
0.787MetMet: 0.787 ± 0.258
0.787MetAsn: 0.787 ± 0.889
0.393MetPro: 0.393 ± 0.186
1.967MetGln: 1.967 ± 0.932
0.787MetArg: 0.787 ± 0.373
2.754MetSer: 2.754 ± 3.113
0.787MetThr: 0.787 ± 0.889
0.393MetVal: 0.393 ± 0.186
0.393MetTrp: 0.393 ± 0.186
0.393MetTyr: 0.393 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
1.574AsnAla: 1.574 ± 0.114
0.393AsnCys: 0.393 ± 0.186
1.18AsnAsp: 1.18 ± 0.559
1.18AsnGlu: 1.18 ± 0.072
2.754AsnPhe: 2.754 ± 0.673
3.934AsnGly: 3.934 ± 1.292
0.787AsnHis: 0.787 ± 0.258
2.36AsnIle: 2.36 ± 0.775
2.36AsnLys: 2.36 ± 1.118
6.294AsnLeu: 6.294 ± 1.436
0.787AsnMet: 0.787 ± 0.373
2.36AsnAsn: 2.36 ± 0.775
2.754AsnPro: 2.754 ± 0.673
2.36AsnGln: 2.36 ± 1.406
1.967AsnArg: 1.967 ± 0.331
5.507AsnSer: 5.507 ± 0.715
2.754AsnThr: 2.754 ± 1.851
4.721AsnVal: 4.721 ± 0.288
0.393AsnTrp: 0.393 ± 0.445
1.574AsnTyr: 1.574 ± 0.745
0.0AsnXaa: 0.0 ± 0.0
Pro
1.18ProAla: 1.18 ± 1.334
0.787ProCys: 0.787 ± 0.258
1.574ProAsp: 1.574 ± 0.745
3.541ProGlu: 3.541 ± 1.046
6.294ProPhe: 6.294 ± 0.174
3.934ProGly: 3.934 ± 2.554
0.0ProHis: 0.0 ± 0.0
4.721ProIle: 4.721 ± 1.551
2.36ProLys: 2.36 ± 1.118
5.507ProLeu: 5.507 ± 1.346
0.787ProMet: 0.787 ± 0.258
2.36ProAsn: 2.36 ± 1.406
1.967ProPro: 1.967 ± 0.331
1.967ProGln: 1.967 ± 0.301
1.574ProArg: 1.574 ± 0.745
2.754ProSer: 2.754 ± 0.673
3.541ProThr: 3.541 ± 0.847
3.934ProVal: 3.934 ± 1.923
0.393ProTrp: 0.393 ± 0.186
3.541ProTyr: 3.541 ± 1.478
0.0ProXaa: 0.0 ± 0.0
Gln
1.967GlnAla: 1.967 ± 0.301
0.0GlnCys: 0.0 ± 0.0
0.787GlnAsp: 0.787 ± 0.258
3.541GlnGlu: 3.541 ± 1.677
1.574GlnPhe: 1.574 ± 0.517
2.36GlnGly: 2.36 ± 0.487
0.787GlnHis: 0.787 ± 0.258
0.393GlnIle: 0.393 ± 0.445
1.574GlnLys: 1.574 ± 0.114
4.327GlnLeu: 4.327 ± 0.156
1.18GlnMet: 1.18 ± 0.703
1.18GlnAsn: 1.18 ± 0.559
1.574GlnPro: 1.574 ± 0.114
0.393GlnGln: 0.393 ± 0.186
0.787GlnArg: 0.787 ± 0.258
1.574GlnSer: 1.574 ± 0.517
2.754GlnThr: 2.754 ± 0.673
1.574GlnVal: 1.574 ± 0.114
0.787GlnTrp: 0.787 ± 0.373
1.18GlnTyr: 1.18 ± 0.072
0.0GlnXaa: 0.0 ± 0.0
Arg
3.541ArgAla: 3.541 ± 0.415
0.393ArgCys: 0.393 ± 0.186
1.574ArgAsp: 1.574 ± 0.745
3.147ArgGlu: 3.147 ± 0.228
2.36ArgPhe: 2.36 ± 0.775
2.36ArgGly: 2.36 ± 0.144
0.787ArgHis: 0.787 ± 0.373
3.934ArgIle: 3.934 ± 0.661
3.147ArgLys: 3.147 ± 0.228
4.327ArgLeu: 4.327 ± 0.475
0.393ArgMet: 0.393 ± 0.186
1.967ArgAsn: 1.967 ± 0.932
5.114ArgPro: 5.114 ± 1.364
0.393ArgGln: 0.393 ± 0.186
0.787ArgArg: 0.787 ± 0.373
2.754ArgSer: 2.754 ± 0.673
3.147ArgThr: 3.147 ± 0.859
1.574ArgVal: 1.574 ± 0.745
0.0ArgTrp: 0.0 ± 0.0
2.36ArgTyr: 2.36 ± 0.775
0.0ArgXaa: 0.0 ± 0.0
Ser
5.114SerAla: 5.114 ± 0.102
1.18SerCys: 1.18 ± 0.559
3.934SerAsp: 3.934 ± 0.601
3.147SerGlu: 3.147 ± 0.859
3.147SerPhe: 3.147 ± 0.403
4.327SerGly: 4.327 ± 0.475
1.967SerHis: 1.967 ± 0.331
5.114SerIle: 5.114 ± 1.16
3.934SerLys: 3.934 ± 0.03
11.802SerLeu: 11.802 ± 3.245
3.147SerMet: 3.147 ± 0.403
4.327SerAsn: 4.327 ± 0.787
3.541SerPro: 3.541 ± 0.415
4.327SerGln: 4.327 ± 0.475
3.147SerArg: 3.147 ± 0.228
7.474SerSer: 7.474 ± 0.246
3.541SerThr: 3.541 ± 1.478
6.688SerVal: 6.688 ± 0.012
0.0SerTrp: 0.0 ± 0.0
3.147SerTyr: 3.147 ± 1.034
0.0SerXaa: 0.0 ± 0.0
Thr
1.967ThrAla: 1.967 ± 0.962
1.574ThrCys: 1.574 ± 0.517
1.967ThrAsp: 1.967 ± 0.301
3.147ThrGlu: 3.147 ± 0.228
2.36ThrPhe: 2.36 ± 0.487
3.541ThrGly: 3.541 ± 0.847
1.18ThrHis: 1.18 ± 0.703
4.327ThrIle: 4.327 ± 1.106
3.934ThrLys: 3.934 ± 0.601
4.721ThrLeu: 4.721 ± 1.551
1.18ThrMet: 1.18 ± 0.559
2.754ThrAsn: 2.754 ± 0.589
4.327ThrPro: 4.327 ± 3.63
0.787ThrGln: 0.787 ± 0.258
2.36ThrArg: 2.36 ± 0.487
5.114ThrSer: 5.114 ± 0.529
5.114ThrThr: 5.114 ± 1.364
5.901ThrVal: 5.901 ± 0.361
1.967ThrTrp: 1.967 ± 0.962
4.327ThrTyr: 4.327 ± 1.737
0.0ThrXaa: 0.0 ± 0.0
Val
7.868ValAla: 7.868 ± 1.953
0.393ValCys: 0.393 ± 0.186
4.327ValAsp: 4.327 ± 0.787
5.507ValGlu: 5.507 ± 0.084
3.541ValPhe: 3.541 ± 0.847
4.721ValGly: 4.721 ± 0.974
0.787ValHis: 0.787 ± 0.373
3.934ValIle: 3.934 ± 0.03
3.934ValLys: 3.934 ± 0.601
3.147ValLeu: 3.147 ± 0.859
1.574ValMet: 1.574 ± 0.517
3.934ValAsn: 3.934 ± 1.292
3.147ValPro: 3.147 ± 0.228
2.36ValGln: 2.36 ± 0.487
3.934ValArg: 3.934 ± 0.601
6.294ValSer: 6.294 ± 1.436
5.901ValThr: 5.901 ± 0.361
4.327ValVal: 4.327 ± 2.049
0.393ValTrp: 0.393 ± 0.186
2.754ValTyr: 2.754 ± 0.589
0.0ValXaa: 0.0 ± 0.0
Trp
0.393TrpAla: 0.393 ± 0.445
0.787TrpCys: 0.787 ± 0.373
1.18TrpAsp: 1.18 ± 0.072
0.393TrpGlu: 0.393 ± 0.186
0.787TrpPhe: 0.787 ± 0.373
0.787TrpGly: 0.787 ± 0.258
0.393TrpHis: 0.393 ± 0.186
0.787TrpIle: 0.787 ± 0.258
0.0TrpLys: 0.0 ± 0.0
1.18TrpLeu: 1.18 ± 0.559
0.0TrpMet: 0.0 ± 0.0
1.574TrpAsn: 1.574 ± 0.114
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.393TrpArg: 0.393 ± 0.186
1.18TrpSer: 1.18 ± 0.072
1.574TrpThr: 1.574 ± 0.745
0.0TrpVal: 0.0 ± 0.0
0.787TrpTrp: 0.787 ± 0.373
0.787TrpTyr: 0.787 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.754TyrAla: 2.754 ± 1.22
1.967TyrCys: 1.967 ± 0.331
1.18TyrAsp: 1.18 ± 0.703
1.967TyrGlu: 1.967 ± 0.301
2.754TyrPhe: 2.754 ± 0.042
4.327TyrGly: 4.327 ± 0.475
0.787TyrHis: 0.787 ± 0.373
1.18TyrIle: 1.18 ± 0.072
1.574TyrLys: 1.574 ± 0.114
3.541TyrLeu: 3.541 ± 0.216
0.787TyrMet: 0.787 ± 0.258
2.36TyrAsn: 2.36 ± 0.144
2.754TyrPro: 2.754 ± 0.589
0.393TyrGln: 0.393 ± 0.186
1.967TyrArg: 1.967 ± 0.331
3.934TyrSer: 3.934 ± 1.923
2.754TyrThr: 2.754 ± 1.22
4.327TyrVal: 4.327 ± 0.475
1.574TyrTrp: 1.574 ± 0.114
2.36TyrTyr: 2.36 ± 0.144
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2543 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski