Amino acid dipepetide frequency for Hubei picorna-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.688AlaAla: 6.688 ± 2.034
1.574AlaCys: 1.574 ± 0.949
4.721AlaAsp: 4.721 ± 1.149
3.147AlaGlu: 3.147 ± 0.1
3.147AlaPhe: 3.147 ± 0.1
3.934AlaGly: 3.934 ± 0.291
1.574AlaHis: 1.574 ± 0.283
4.327AlaIle: 4.327 ± 0.054
3.541AlaLys: 3.541 ± 1.469
5.114AlaLeu: 5.114 ± 2.417
1.967AlaMet: 1.967 ± 1.186
6.688AlaAsn: 6.688 ± 3.292
5.901AlaPro: 5.901 ± 0.437
1.574AlaGln: 1.574 ± 0.283
3.934AlaArg: 3.934 ± 0.291
6.294AlaSer: 6.294 ± 0.2
4.327AlaThr: 4.327 ± 1.386
5.114AlaVal: 5.114 ± 0.246
2.754AlaTrp: 2.754 ± 1.003
3.147AlaTyr: 3.147 ± 0.566
0.0AlaXaa: 0.0 ± 0.0
Cys
0.787CysAla: 0.787 ± 0.474
0.787CysCys: 0.787 ± 0.857
0.787CysAsp: 0.787 ± 0.474
1.574CysGlu: 1.574 ± 0.949
0.0CysPhe: 0.0 ± 0.0
0.787CysGly: 0.787 ± 0.191
0.0CysHis: 0.0 ± 0.0
1.18CysIle: 1.18 ± 0.711
1.18CysLys: 1.18 ± 0.711
0.787CysLeu: 0.787 ± 0.191
0.0CysMet: 0.0 ± 0.0
0.393CysAsn: 0.393 ± 0.237
1.18CysPro: 1.18 ± 0.046
0.393CysGln: 0.393 ± 0.237
0.393CysArg: 0.393 ± 0.237
0.787CysSer: 0.787 ± 0.474
0.393CysThr: 0.393 ± 0.237
1.967CysVal: 1.967 ± 0.52
0.393CysTrp: 0.393 ± 0.237
1.967CysTyr: 1.967 ± 0.811
0.0CysXaa: 0.0 ± 0.0
Asp
4.327AspAla: 4.327 ± 1.943
1.18AspCys: 1.18 ± 0.046
4.327AspAsp: 4.327 ± 0.611
1.967AspGlu: 1.967 ± 0.146
3.541AspPhe: 3.541 ± 0.529
2.36AspGly: 2.36 ± 0.091
1.967AspHis: 1.967 ± 0.52
3.934AspIle: 3.934 ± 1.04
1.967AspLys: 1.967 ± 0.52
5.507AspLeu: 5.507 ± 1.34
0.393AspMet: 0.393 ± 0.237
2.36AspAsn: 2.36 ± 1.906
2.754AspPro: 2.754 ± 1.669
1.574AspGln: 1.574 ± 0.949
2.36AspArg: 2.36 ± 0.757
3.147AspSer: 3.147 ± 0.766
2.754AspThr: 2.754 ± 0.337
5.114AspVal: 5.114 ± 0.246
0.787AspTrp: 0.787 ± 0.474
1.574AspTyr: 1.574 ± 0.283
0.0AspXaa: 0.0 ± 0.0
Glu
6.294GluAla: 6.294 ± 3.129
0.787GluCys: 0.787 ± 0.474
3.147GluAsp: 3.147 ± 0.766
5.507GluGlu: 5.507 ± 1.323
1.967GluPhe: 1.967 ± 0.146
2.36GluGly: 2.36 ± 0.574
1.967GluHis: 1.967 ± 1.186
3.147GluIle: 3.147 ± 0.766
2.36GluLys: 2.36 ± 0.757
5.901GluLeu: 5.901 ± 0.229
1.574GluMet: 1.574 ± 0.283
0.787GluAsn: 0.787 ± 0.474
1.574GluPro: 1.574 ± 0.283
1.18GluGln: 1.18 ± 0.711
2.754GluArg: 2.754 ± 0.994
2.36GluSer: 2.36 ± 0.091
1.967GluThr: 1.967 ± 1.186
3.541GluVal: 3.541 ± 2.134
0.393GluTrp: 0.393 ± 0.237
2.754GluTyr: 2.754 ± 0.994
0.0GluXaa: 0.0 ± 0.0
Phe
3.541PheAla: 3.541 ± 0.529
0.393PheCys: 0.393 ± 0.429
2.754PheAsp: 2.754 ± 0.994
0.787PheGlu: 0.787 ± 0.474
3.541PhePhe: 3.541 ± 1.194
3.147PheGly: 3.147 ± 0.766
1.574PheHis: 1.574 ± 0.283
1.574PheIle: 1.574 ± 0.283
3.147PheLys: 3.147 ± 0.566
5.507PheLeu: 5.507 ± 0.009
0.0PheMet: 0.0 ± 0.0
1.18PheAsn: 1.18 ± 0.711
1.967PhePro: 1.967 ± 1.186
1.18PheGln: 1.18 ± 0.046
1.18PheArg: 1.18 ± 0.046
2.36PheSer: 2.36 ± 0.574
4.721PheThr: 4.721 ± 1.149
3.541PheVal: 3.541 ± 1.194
1.18PheTrp: 1.18 ± 0.711
1.967PheTyr: 1.967 ± 0.52
0.0PheXaa: 0.0 ± 0.0
Gly
2.36GlyAla: 2.36 ± 0.574
0.393GlyCys: 0.393 ± 0.237
4.327GlyAsp: 4.327 ± 0.611
4.721GlyGlu: 4.721 ± 0.483
2.754GlyPhe: 2.754 ± 0.337
2.754GlyGly: 2.754 ± 1.003
1.18GlyHis: 1.18 ± 0.046
2.36GlyIle: 2.36 ± 0.574
3.147GlyLys: 3.147 ± 1.231
2.754GlyLeu: 2.754 ± 1.003
1.18GlyMet: 1.18 ± 0.046
1.967GlyAsn: 1.967 ± 0.146
2.36GlyPro: 2.36 ± 1.24
3.541GlyGln: 3.541 ± 0.529
1.967GlyArg: 1.967 ± 0.146
5.507GlySer: 5.507 ± 3.337
6.688GlyThr: 6.688 ± 1.96
7.081GlyVal: 7.081 ± 1.606
0.787GlyTrp: 0.787 ± 0.474
3.147GlyTyr: 3.147 ± 1.432
0.0GlyXaa: 0.0 ± 0.0
His
2.36HisAla: 2.36 ± 0.091
0.787HisCys: 0.787 ± 0.474
1.18HisAsp: 1.18 ± 0.62
0.787HisGlu: 0.787 ± 0.474
1.574HisPhe: 1.574 ± 0.283
4.721HisGly: 4.721 ± 1.514
0.787HisHis: 0.787 ± 0.474
1.18HisIle: 1.18 ± 0.62
1.18HisLys: 1.18 ± 0.046
2.36HisLeu: 2.36 ± 0.757
0.787HisMet: 0.787 ± 0.191
0.393HisAsn: 0.393 ± 0.237
0.787HisPro: 0.787 ± 0.474
0.393HisGln: 0.393 ± 0.429
1.18HisArg: 1.18 ± 0.046
1.18HisSer: 1.18 ± 0.711
2.754HisThr: 2.754 ± 0.994
1.574HisVal: 1.574 ± 0.949
0.0HisTrp: 0.0 ± 0.0
0.393HisTyr: 0.393 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
4.327IleAla: 4.327 ± 0.054
0.787IleCys: 0.787 ± 0.474
5.901IleAsp: 5.901 ± 0.437
5.901IleGlu: 5.901 ± 0.894
1.574IlePhe: 1.574 ± 0.283
3.934IleGly: 3.934 ± 1.623
3.541IleHis: 3.541 ± 1.469
3.147IleIle: 3.147 ± 0.766
2.36IleLys: 2.36 ± 1.24
3.541IleLeu: 3.541 ± 0.137
0.393IleMet: 0.393 ± 0.237
2.754IleAsn: 2.754 ± 0.994
2.754IlePro: 2.754 ± 1.003
1.574IleGln: 1.574 ± 0.283
2.36IleArg: 2.36 ± 0.091
1.967IleSer: 1.967 ± 0.146
4.327IleThr: 4.327 ± 2.052
1.18IleVal: 1.18 ± 0.046
0.787IleTrp: 0.787 ± 0.191
0.787IleTyr: 0.787 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
3.934LysAla: 3.934 ± 1.04
0.787LysCys: 0.787 ± 0.474
1.574LysAsp: 1.574 ± 0.949
3.147LysGlu: 3.147 ± 1.231
2.36LysPhe: 2.36 ± 0.091
2.36LysGly: 2.36 ± 0.091
1.967LysHis: 1.967 ± 1.186
3.541LysIle: 3.541 ± 0.529
3.147LysLys: 3.147 ± 0.566
4.721LysLeu: 4.721 ± 0.183
0.787LysMet: 0.787 ± 0.474
2.754LysAsn: 2.754 ± 0.329
1.967LysPro: 1.967 ± 0.146
1.967LysGln: 1.967 ± 0.146
1.967LysArg: 1.967 ± 1.186
3.147LysSer: 3.147 ± 1.897
1.967LysThr: 1.967 ± 0.52
3.541LysVal: 3.541 ± 0.137
1.18LysTrp: 1.18 ± 0.711
1.967LysTyr: 1.967 ± 0.52
0.0LysXaa: 0.0 ± 0.0
Leu
8.261LeuAla: 8.261 ± 2.343
1.574LeuCys: 1.574 ± 0.283
3.934LeuAsp: 3.934 ± 0.957
4.721LeuGlu: 4.721 ± 2.18
2.36LeuPhe: 2.36 ± 0.574
5.901LeuGly: 5.901 ± 0.894
1.967LeuHis: 1.967 ± 0.811
2.754LeuIle: 2.754 ± 0.329
3.934LeuLys: 3.934 ± 1.04
7.868LeuLeu: 7.868 ± 0.748
3.147LeuMet: 3.147 ± 0.1
3.541LeuAsn: 3.541 ± 0.137
3.147LeuPro: 3.147 ± 0.566
5.901LeuGln: 5.901 ± 0.229
2.754LeuArg: 2.754 ± 0.337
5.114LeuSer: 5.114 ± 0.246
8.261LeuThr: 8.261 ± 1.012
4.721LeuVal: 4.721 ± 0.849
0.787LeuTrp: 0.787 ± 0.474
1.18LeuTyr: 1.18 ± 0.711
0.0LeuXaa: 0.0 ± 0.0
Met
1.574MetAla: 1.574 ± 0.283
0.0MetCys: 0.0 ± 0.0
1.574MetAsp: 1.574 ± 0.949
0.787MetGlu: 0.787 ± 0.474
1.574MetPhe: 1.574 ± 0.283
2.36MetGly: 2.36 ± 0.574
0.787MetHis: 0.787 ± 0.474
1.574MetIle: 1.574 ± 0.383
0.393MetLys: 0.393 ± 0.237
3.147MetLeu: 3.147 ± 0.566
0.0MetMet: 0.0 ± 0.0
1.18MetAsn: 1.18 ± 0.046
0.393MetPro: 0.393 ± 0.429
0.393MetGln: 0.393 ± 0.237
1.967MetArg: 1.967 ± 0.52
1.18MetSer: 1.18 ± 0.62
2.754MetThr: 2.754 ± 0.337
1.574MetVal: 1.574 ± 0.949
0.393MetTrp: 0.393 ± 0.237
1.18MetTyr: 1.18 ± 0.711
0.0MetXaa: 0.0 ± 0.0
Asn
4.327AsnAla: 4.327 ± 1.386
0.393AsnCys: 0.393 ± 0.237
0.787AsnAsp: 0.787 ± 0.191
1.574AsnGlu: 1.574 ± 0.383
2.36AsnPhe: 2.36 ± 1.24
3.147AsnGly: 3.147 ± 1.432
0.393AsnHis: 0.393 ± 0.429
3.541AsnIle: 3.541 ± 1.194
3.541AsnLys: 3.541 ± 1.194
3.147AsnLeu: 3.147 ± 0.1
1.574AsnMet: 1.574 ± 0.202
3.541AsnAsn: 3.541 ± 0.529
2.754AsnPro: 2.754 ± 0.337
0.787AsnGln: 0.787 ± 0.191
1.18AsnArg: 1.18 ± 0.62
3.541AsnSer: 3.541 ± 0.803
2.754AsnThr: 2.754 ± 1.003
5.901AsnVal: 5.901 ± 2.226
1.18AsnTrp: 1.18 ± 1.286
1.18AsnTyr: 1.18 ± 1.286
0.0AsnXaa: 0.0 ± 0.0
Pro
2.36ProAla: 2.36 ± 1.24
0.0ProCys: 0.0 ± 0.0
2.754ProAsp: 2.754 ± 0.329
1.574ProGlu: 1.574 ± 0.283
2.36ProPhe: 2.36 ± 1.24
2.754ProGly: 2.754 ± 0.337
1.574ProHis: 1.574 ± 0.383
1.967ProIle: 1.967 ± 0.52
1.967ProLys: 1.967 ± 0.146
3.541ProLeu: 3.541 ± 0.137
1.18ProMet: 1.18 ± 0.62
0.787ProAsn: 0.787 ± 0.191
1.18ProPro: 1.18 ± 0.711
1.18ProGln: 1.18 ± 0.046
1.967ProArg: 1.967 ± 0.146
3.934ProSer: 3.934 ± 0.957
5.507ProThr: 5.507 ± 2.672
5.901ProVal: 5.901 ± 0.437
1.574ProTrp: 1.574 ± 0.383
3.541ProTyr: 3.541 ± 1.194
0.0ProXaa: 0.0 ± 0.0
Gln
3.934GlnAla: 3.934 ± 1.04
0.393GlnCys: 0.393 ± 0.237
1.574GlnAsp: 1.574 ± 0.283
2.754GlnGlu: 2.754 ± 0.994
1.967GlnPhe: 1.967 ± 0.52
2.754GlnGly: 2.754 ± 1.003
0.787GlnHis: 0.787 ± 0.474
3.147GlnIle: 3.147 ± 1.432
3.147GlnLys: 3.147 ± 0.566
1.18GlnLeu: 1.18 ± 0.046
1.18GlnMet: 1.18 ± 0.711
0.787GlnAsn: 0.787 ± 0.857
1.967GlnPro: 1.967 ± 1.477
1.18GlnGln: 1.18 ± 0.62
1.574GlnArg: 1.574 ± 0.383
1.574GlnSer: 1.574 ± 0.383
0.787GlnThr: 0.787 ± 0.474
3.147GlnVal: 3.147 ± 0.766
0.0GlnTrp: 0.0 ± 0.0
1.574GlnTyr: 1.574 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
1.967ArgAla: 1.967 ± 0.52
0.393ArgCys: 0.393 ± 0.429
1.574ArgAsp: 1.574 ± 0.949
0.787ArgGlu: 0.787 ± 0.474
3.147ArgPhe: 3.147 ± 1.231
2.754ArgGly: 2.754 ± 1.669
0.787ArgHis: 0.787 ± 0.474
3.147ArgIle: 3.147 ± 0.566
1.967ArgLys: 1.967 ± 0.52
4.721ArgLeu: 4.721 ± 0.849
1.574ArgMet: 1.574 ± 0.949
1.967ArgAsn: 1.967 ± 0.52
2.36ArgPro: 2.36 ± 0.091
1.18ArgGln: 1.18 ± 0.046
3.934ArgArg: 3.934 ± 0.957
3.934ArgSer: 3.934 ± 1.04
1.574ArgThr: 1.574 ± 0.283
7.081ArgVal: 7.081 ± 1.723
0.787ArgTrp: 0.787 ± 0.191
1.967ArgTyr: 1.967 ± 0.146
0.0ArgXaa: 0.0 ± 0.0
Ser
5.114SerAla: 5.114 ± 2.243
0.787SerCys: 0.787 ± 0.191
2.36SerAsp: 2.36 ± 0.091
2.754SerGlu: 2.754 ± 0.994
5.507SerPhe: 5.507 ± 0.009
1.574SerGly: 1.574 ± 0.383
0.787SerHis: 0.787 ± 0.474
3.147SerIle: 3.147 ± 0.1
2.754SerLys: 2.754 ± 1.66
7.474SerLeu: 7.474 ± 0.154
3.147SerMet: 3.147 ± 0.566
5.507SerAsn: 5.507 ± 1.34
1.967SerPro: 1.967 ± 1.477
3.147SerGln: 3.147 ± 2.763
1.967SerArg: 1.967 ± 0.146
4.327SerSer: 4.327 ± 1.943
5.901SerThr: 5.901 ± 1.103
5.114SerVal: 5.114 ± 0.912
0.393SerTrp: 0.393 ± 0.237
1.967SerTyr: 1.967 ± 0.811
0.0SerXaa: 0.0 ± 0.0
Thr
5.901ThrAla: 5.901 ± 0.437
1.574ThrCys: 1.574 ± 0.283
3.541ThrAsp: 3.541 ± 1.194
1.967ThrGlu: 1.967 ± 0.52
1.574ThrPhe: 1.574 ± 0.949
3.541ThrGly: 3.541 ± 0.137
1.18ThrHis: 1.18 ± 0.62
2.754ThrIle: 2.754 ± 0.337
3.934ThrLys: 3.934 ± 0.374
7.081ThrLeu: 7.081 ± 1.057
1.574ThrMet: 1.574 ± 0.383
3.541ThrAsn: 3.541 ± 2.526
7.081ThrPro: 7.081 ± 2.389
4.327ThrGln: 4.327 ± 0.72
4.721ThrArg: 4.721 ± 0.849
7.081ThrSer: 7.081 ± 3.054
3.934ThrThr: 3.934 ± 0.374
5.507ThrVal: 5.507 ± 0.674
1.18ThrTrp: 1.18 ± 1.286
1.574ThrTyr: 1.574 ± 0.949
0.0ThrXaa: 0.0 ± 0.0
Val
7.474ValAla: 7.474 ± 0.154
1.574ValCys: 1.574 ± 0.949
4.721ValAsp: 4.721 ± 1.149
6.294ValGlu: 6.294 ± 1.797
1.18ValPhe: 1.18 ± 0.711
6.294ValGly: 6.294 ± 0.866
1.967ValHis: 1.967 ± 1.186
4.327ValIle: 4.327 ± 0.611
2.36ValLys: 2.36 ± 1.423
4.327ValLeu: 4.327 ± 0.054
1.18ValMet: 1.18 ± 0.205
3.147ValAsn: 3.147 ± 0.766
3.934ValPro: 3.934 ± 0.957
3.147ValGln: 3.147 ± 0.566
7.081ValArg: 7.081 ± 1.606
4.327ValSer: 4.327 ± 0.72
4.721ValThr: 4.721 ± 1.814
8.655ValVal: 8.655 ± 0.774
1.18ValTrp: 1.18 ± 0.711
5.114ValTyr: 5.114 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
1.18TrpAla: 1.18 ± 0.62
0.787TrpCys: 0.787 ± 0.191
1.574TrpAsp: 1.574 ± 0.949
0.393TrpGlu: 0.393 ± 0.237
1.18TrpPhe: 1.18 ± 0.711
0.787TrpGly: 0.787 ± 0.857
0.393TrpHis: 0.393 ± 0.237
0.393TrpIle: 0.393 ± 0.429
0.393TrpLys: 0.393 ± 0.237
0.787TrpLeu: 0.787 ± 0.191
0.0TrpMet: 0.0 ± 0.0
1.967TrpAsn: 1.967 ± 0.811
0.393TrpPro: 0.393 ± 0.237
0.787TrpGln: 0.787 ± 0.191
0.787TrpArg: 0.787 ± 0.191
1.574TrpSer: 1.574 ± 0.383
1.967TrpThr: 1.967 ± 1.186
0.393TrpVal: 0.393 ± 0.237
0.787TrpTrp: 0.787 ± 0.857
0.787TrpTyr: 0.787 ± 0.474
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.754TyrAla: 2.754 ± 0.337
0.787TyrCys: 0.787 ± 0.474
0.787TyrAsp: 0.787 ± 0.191
1.18TyrGlu: 1.18 ± 0.046
1.574TyrPhe: 1.574 ± 0.283
2.754TyrGly: 2.754 ± 0.329
1.18TyrHis: 1.18 ± 0.62
2.754TyrIle: 2.754 ± 0.994
2.36TyrLys: 2.36 ± 1.423
2.36TyrLeu: 2.36 ± 0.091
2.754TyrMet: 2.754 ± 0.329
2.36TyrAsn: 2.36 ± 0.574
1.18TyrPro: 1.18 ± 0.046
0.0TyrGln: 0.0 ± 0.0
1.574TyrArg: 1.574 ± 0.283
2.36TyrSer: 2.36 ± 1.24
5.507TyrThr: 5.507 ± 0.674
2.754TyrVal: 2.754 ± 0.337
0.787TyrTrp: 0.787 ± 0.474
1.18TyrTyr: 1.18 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2543 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski