Amino acid dipepetide frequency for Beihai picorna-like virus 119

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.002AlaAla: 4.002 ± 0.061
0.616AlaCys: 0.616 ± 0.274
3.079AlaAsp: 3.079 ± 0.35
4.926AlaGlu: 4.926 ± 0.472
3.387AlaPhe: 3.387 ± 1.073
5.542AlaGly: 5.542 ± 0.114
0.616AlaHis: 0.616 ± 0.274
3.695AlaIle: 3.695 ± 2.656
3.695AlaLys: 3.695 ± 0.784
8.005AlaLeu: 8.005 ± 3.318
1.232AlaMet: 1.232 ± 0.548
2.463AlaAsn: 2.463 ± 0.236
3.079AlaPro: 3.079 ± 0.51
4.31AlaGln: 4.31 ± 4.103
3.079AlaArg: 3.079 ± 0.51
3.079AlaSer: 3.079 ± 0.51
5.234AlaThr: 5.234 ± 1.111
2.771AlaVal: 2.771 ± 0.373
0.616AlaTrp: 0.616 ± 0.274
1.847AlaTyr: 1.847 ± 0.822
0.0AlaXaa: 0.0 ± 0.0
Cys
0.308CysAla: 0.308 ± 0.137
0.0CysCys: 0.0 ± 0.0
1.232CysAsp: 1.232 ± 0.548
0.616CysGlu: 0.616 ± 0.274
0.924CysPhe: 0.924 ± 0.411
1.847CysGly: 1.847 ± 0.822
0.308CysHis: 0.308 ± 0.137
0.924CysIle: 0.924 ± 0.411
0.308CysLys: 0.308 ± 0.137
1.539CysLeu: 1.539 ± 0.685
0.308CysMet: 0.308 ± 0.137
0.308CysAsn: 0.308 ± 0.137
0.616CysPro: 0.616 ± 0.274
0.616CysGln: 0.616 ± 0.274
0.616CysArg: 0.616 ± 0.586
1.539CysSer: 1.539 ± 0.685
2.155CysThr: 2.155 ± 0.959
1.232CysVal: 1.232 ± 0.548
0.308CysTrp: 0.308 ± 0.137
0.308CysTyr: 0.308 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
4.31AspAla: 4.31 ± 2.382
1.539AspCys: 1.539 ± 0.685
7.389AspAsp: 7.389 ± 0.709
4.002AspGlu: 4.002 ± 1.782
5.85AspPhe: 5.85 ± 1.697
3.079AspGly: 3.079 ± 1.21
2.463AspHis: 2.463 ± 0.236
2.463AspIle: 2.463 ± 0.236
4.31AspLys: 4.31 ± 1.059
6.466AspLeu: 6.466 ± 0.298
1.232AspMet: 1.232 ± 0.548
0.616AspAsn: 0.616 ± 0.274
4.926AspPro: 4.926 ± 2.193
3.695AspGln: 3.695 ± 0.076
2.155AspArg: 2.155 ± 0.761
4.926AspSer: 4.926 ± 0.472
2.771AspThr: 2.771 ± 0.487
3.079AspVal: 3.079 ± 2.07
1.232AspTrp: 1.232 ± 0.548
2.463AspTyr: 2.463 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
6.466GluAla: 6.466 ± 1.158
0.616GluCys: 0.616 ± 0.274
5.542GluAsp: 5.542 ± 0.747
7.697GluGlu: 7.697 ± 0.875
1.847GluPhe: 1.847 ± 0.038
2.771GluGly: 2.771 ± 1.347
1.232GluHis: 1.232 ± 0.548
4.31GluIle: 4.31 ± 1.059
5.542GluLys: 5.542 ± 2.467
5.85GluLeu: 5.85 ± 0.884
1.847GluMet: 1.847 ± 0.474
1.847GluAsn: 1.847 ± 0.038
2.155GluPro: 2.155 ± 0.959
3.695GluGln: 3.695 ± 0.936
4.618GluArg: 4.618 ± 1.196
3.695GluSer: 3.695 ± 0.936
1.847GluThr: 1.847 ± 0.038
3.387GluVal: 3.387 ± 0.647
1.232GluTrp: 1.232 ± 0.548
1.539GluTyr: 1.539 ± 0.175
0.0GluXaa: 0.0 ± 0.0
Phe
2.771PheAla: 2.771 ± 0.487
0.616PheCys: 0.616 ± 0.274
3.695PheAsp: 3.695 ± 0.784
4.002PheGlu: 4.002 ± 0.799
1.539PhePhe: 1.539 ± 1.035
1.847PheGly: 1.847 ± 0.822
0.924PheHis: 0.924 ± 0.411
1.847PheIle: 1.847 ± 0.822
2.463PheLys: 2.463 ± 0.236
4.926PheLeu: 4.926 ± 0.472
0.924PheMet: 0.924 ± 0.411
1.232PheAsn: 1.232 ± 1.172
1.232PhePro: 1.232 ± 0.312
3.387PheGln: 3.387 ± 0.213
2.155PheArg: 2.155 ± 0.099
2.463PheSer: 2.463 ± 1.484
4.002PheThr: 4.002 ± 2.519
2.463PheVal: 2.463 ± 0.624
1.539PheTrp: 1.539 ± 0.685
1.539PheTyr: 1.539 ± 0.685
0.0PheXaa: 0.0 ± 0.0
Gly
6.773GlyAla: 6.773 ± 3.866
1.232GlyCys: 1.232 ± 0.548
4.618GlyAsp: 4.618 ± 0.335
2.463GlyGlu: 2.463 ± 1.484
3.079GlyPhe: 3.079 ± 0.51
4.002GlyGly: 4.002 ± 0.799
0.924GlyHis: 0.924 ± 1.309
4.002GlyIle: 4.002 ± 0.922
5.542GlyLys: 5.542 ± 0.747
3.695GlyLeu: 3.695 ± 0.936
1.232GlyMet: 1.232 ± 0.548
2.463GlyAsn: 2.463 ± 0.236
2.463GlyPro: 2.463 ± 0.236
1.847GlyGln: 1.847 ± 0.822
4.002GlyArg: 4.002 ± 0.799
2.463GlySer: 2.463 ± 0.624
2.463GlyThr: 2.463 ± 0.624
4.002GlyVal: 4.002 ± 0.799
0.616GlyTrp: 0.616 ± 0.274
1.847GlyTyr: 1.847 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.539HisAla: 1.539 ± 0.685
0.0HisCys: 0.0 ± 0.0
0.924HisAsp: 0.924 ± 0.411
1.232HisGlu: 1.232 ± 0.312
1.847HisPhe: 1.847 ± 0.038
2.155HisGly: 2.155 ± 1.621
1.232HisHis: 1.232 ± 0.548
1.847HisIle: 1.847 ± 0.822
1.232HisLys: 1.232 ± 1.172
2.771HisLeu: 2.771 ± 1.234
0.616HisMet: 0.616 ± 0.274
0.308HisAsn: 0.308 ± 0.723
2.463HisPro: 2.463 ± 2.344
1.539HisGln: 1.539 ± 1.035
0.924HisArg: 0.924 ± 0.411
1.539HisSer: 1.539 ± 0.175
0.924HisThr: 0.924 ± 0.411
1.847HisVal: 1.847 ± 0.822
0.308HisTrp: 0.308 ± 0.137
1.232HisTyr: 1.232 ± 0.548
0.0HisXaa: 0.0 ± 0.0
Ile
5.542IleAla: 5.542 ± 0.747
0.924IleCys: 0.924 ± 0.411
4.002IleAsp: 4.002 ± 1.659
2.463IleGlu: 2.463 ± 0.624
1.847IlePhe: 1.847 ± 0.038
1.847IleGly: 1.847 ± 0.822
2.771IleHis: 2.771 ± 0.373
0.924IleIle: 0.924 ± 0.411
3.079IleLys: 3.079 ± 0.51
3.079IleLeu: 3.079 ± 0.51
0.924IleMet: 0.924 ± 0.411
4.002IleAsn: 4.002 ± 0.799
4.618IlePro: 4.618 ± 0.525
2.771IleGln: 2.771 ± 1.234
4.002IleArg: 4.002 ± 1.659
2.155IleSer: 2.155 ± 0.761
3.387IleThr: 3.387 ± 1.508
3.079IleVal: 3.079 ± 0.51
0.616IleTrp: 0.616 ± 0.274
2.463IleTyr: 2.463 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
4.618LysAla: 4.618 ± 1.196
0.0LysCys: 0.0 ± 0.0
3.695LysAsp: 3.695 ± 0.784
4.926LysGlu: 4.926 ± 0.472
1.847LysPhe: 1.847 ± 0.822
3.079LysGly: 3.079 ± 0.51
1.847LysHis: 1.847 ± 0.038
4.618LysIle: 4.618 ± 1.196
5.85LysLys: 5.85 ± 2.604
4.618LysLeu: 4.618 ± 0.525
2.463LysMet: 2.463 ± 0.236
1.539LysAsn: 1.539 ± 0.175
3.695LysPro: 3.695 ± 0.076
3.695LysGln: 3.695 ± 0.784
3.695LysArg: 3.695 ± 1.645
2.463LysSer: 2.463 ± 1.096
4.618LysThr: 4.618 ± 2.056
2.463LysVal: 2.463 ± 0.236
1.847LysTrp: 1.847 ± 0.038
0.924LysTyr: 0.924 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
4.31LeuAla: 4.31 ± 1.522
1.232LeuCys: 1.232 ± 0.548
2.771LeuAsp: 2.771 ± 0.487
8.313LeuGlu: 8.313 ± 1.98
2.155LeuPhe: 2.155 ± 0.959
6.158LeuGly: 6.158 ± 2.741
3.387LeuHis: 3.387 ± 2.793
3.079LeuIle: 3.079 ± 0.51
6.158LeuLys: 6.158 ± 0.16
7.389LeuLeu: 7.389 ± 2.429
1.539LeuMet: 1.539 ± 0.175
3.695LeuAsn: 3.695 ± 0.076
4.31LeuPro: 4.31 ± 0.662
4.002LeuGln: 4.002 ± 1.782
5.234LeuArg: 5.234 ± 0.251
5.234LeuSer: 5.234 ± 0.251
4.926LeuThr: 4.926 ± 1.248
4.31LeuVal: 4.31 ± 0.662
1.232LeuTrp: 1.232 ± 1.172
1.847LeuTyr: 1.847 ± 0.822
0.0LeuXaa: 0.0 ± 0.0
Met
1.232MetAla: 1.232 ± 0.312
0.616MetCys: 0.616 ± 0.274
1.232MetAsp: 1.232 ± 0.548
2.155MetGlu: 2.155 ± 0.959
0.308MetPhe: 0.308 ± 0.137
0.616MetGly: 0.616 ± 0.274
0.616MetHis: 0.616 ± 0.274
1.232MetIle: 1.232 ± 0.312
1.232MetLys: 1.232 ± 0.548
2.463MetLeu: 2.463 ± 1.096
0.924MetMet: 0.924 ± 0.449
0.616MetAsn: 0.616 ± 0.274
1.847MetPro: 1.847 ± 0.898
1.232MetGln: 1.232 ± 0.312
1.232MetArg: 1.232 ± 0.548
0.308MetSer: 0.308 ± 0.137
1.232MetThr: 1.232 ± 0.312
1.847MetVal: 1.847 ± 1.758
0.616MetTrp: 0.616 ± 0.274
0.924MetTyr: 0.924 ± 0.449
0.0MetXaa: 0.0 ± 0.0
Asn
1.232AsnAla: 1.232 ± 0.312
1.232AsnCys: 1.232 ± 0.548
1.539AsnAsp: 1.539 ± 1.035
2.771AsnGlu: 2.771 ± 0.373
2.771AsnPhe: 2.771 ± 0.487
1.847AsnGly: 1.847 ± 0.898
0.924AsnHis: 0.924 ± 1.309
1.539AsnIle: 1.539 ± 0.685
2.155AsnLys: 2.155 ± 0.959
0.924AsnLeu: 0.924 ± 0.449
0.924AsnMet: 0.924 ± 0.449
2.155AsnAsn: 2.155 ± 0.959
3.387AsnPro: 3.387 ± 0.213
1.232AsnGln: 1.232 ± 0.312
3.079AsnArg: 3.079 ± 1.371
1.847AsnSer: 1.847 ± 0.038
3.695AsnThr: 3.695 ± 0.936
3.079AsnVal: 3.079 ± 0.35
1.539AsnTrp: 1.539 ± 0.175
0.308AsnTyr: 0.308 ± 0.137
0.0AsnXaa: 0.0 ± 0.0
Pro
2.155ProAla: 2.155 ± 0.959
0.924ProCys: 0.924 ± 0.411
4.31ProAsp: 4.31 ± 1.522
4.926ProGlu: 4.926 ± 1.333
3.079ProPhe: 3.079 ± 2.07
2.771ProGly: 2.771 ± 1.347
1.232ProHis: 1.232 ± 0.548
3.387ProIle: 3.387 ± 0.213
4.618ProLys: 4.618 ± 1.196
3.695ProLeu: 3.695 ± 0.784
1.539ProMet: 1.539 ± 1.035
2.463ProAsn: 2.463 ± 0.236
3.695ProPro: 3.695 ± 0.936
3.387ProGln: 3.387 ± 0.647
2.463ProArg: 2.463 ± 0.236
3.695ProSer: 3.695 ± 0.076
2.771ProThr: 2.771 ± 0.373
3.387ProVal: 3.387 ± 1.933
1.232ProTrp: 1.232 ± 0.312
0.924ProTyr: 0.924 ± 0.411
0.0ProXaa: 0.0 ± 0.0
Gln
3.695GlnAla: 3.695 ± 0.936
1.539GlnCys: 1.539 ± 0.175
3.695GlnAsp: 3.695 ± 0.784
2.771GlnGlu: 2.771 ± 1.234
1.539GlnPhe: 1.539 ± 0.685
3.695GlnGly: 3.695 ± 0.784
2.463GlnHis: 2.463 ± 1.096
3.387GlnIle: 3.387 ± 0.647
2.463GlnLys: 2.463 ± 0.236
1.847GlnLeu: 1.847 ± 1.758
0.924GlnMet: 0.924 ± 0.411
1.847GlnAsn: 1.847 ± 0.038
2.771GlnPro: 2.771 ± 0.373
3.387GlnGln: 3.387 ± 1.508
3.695GlnArg: 3.695 ± 0.076
2.155GlnSer: 2.155 ± 1.621
2.463GlnThr: 2.463 ± 0.236
3.695GlnVal: 3.695 ± 0.076
0.0GlnTrp: 0.0 ± 0.0
1.847GlnTyr: 1.847 ± 0.898
0.0GlnXaa: 0.0 ± 0.0
Arg
3.695ArgAla: 3.695 ± 0.076
1.232ArgCys: 1.232 ± 0.548
4.618ArgAsp: 4.618 ± 0.335
2.771ArgGlu: 2.771 ± 1.234
1.847ArgPhe: 1.847 ± 0.038
3.079ArgGly: 3.079 ± 0.51
1.847ArgHis: 1.847 ± 0.038
2.463ArgIle: 2.463 ± 0.624
1.847ArgLys: 1.847 ± 0.822
4.618ArgLeu: 4.618 ± 0.335
1.539ArgMet: 1.539 ± 0.627
2.463ArgAsn: 2.463 ± 1.096
2.771ArgPro: 2.771 ± 0.373
2.155ArgGln: 2.155 ± 0.099
4.002ArgArg: 4.002 ± 1.782
4.926ArgSer: 4.926 ± 0.388
4.002ArgThr: 4.002 ± 0.922
5.234ArgVal: 5.234 ± 2.831
1.232ArgTrp: 1.232 ± 0.548
0.924ArgTyr: 0.924 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
2.155SerAla: 2.155 ± 0.099
1.539SerCys: 1.539 ± 0.685
5.542SerAsp: 5.542 ± 0.974
3.695SerGlu: 3.695 ± 0.936
3.695SerPhe: 3.695 ± 0.936
2.463SerGly: 2.463 ± 0.624
0.616SerHis: 0.616 ± 0.274
3.695SerIle: 3.695 ± 0.076
2.155SerLys: 2.155 ± 0.099
4.926SerLeu: 4.926 ± 1.248
0.924SerMet: 0.924 ± 1.309
1.847SerAsn: 1.847 ± 1.758
4.926SerPro: 4.926 ± 0.472
1.539SerGln: 1.539 ± 0.685
2.463SerArg: 2.463 ± 1.096
3.387SerSer: 3.387 ± 0.213
4.926SerThr: 4.926 ± 0.472
5.234SerVal: 5.234 ± 0.251
0.924SerTrp: 0.924 ± 0.449
1.847SerTyr: 1.847 ± 0.898
0.0SerXaa: 0.0 ± 0.0
Thr
3.387ThrAla: 3.387 ± 1.073
0.616ThrCys: 0.616 ± 0.274
3.695ThrAsp: 3.695 ± 0.076
2.771ThrGlu: 2.771 ± 0.373
2.155ThrPhe: 2.155 ± 0.099
6.773ThrGly: 6.773 ± 2.146
1.847ThrHis: 1.847 ± 0.038
5.234ThrIle: 5.234 ± 1.111
3.695ThrLys: 3.695 ± 0.784
6.466ThrLeu: 6.466 ± 2.018
0.924ThrMet: 0.924 ± 0.411
2.463ThrAsn: 2.463 ± 0.624
3.079ThrPro: 3.079 ± 0.51
2.771ThrGln: 2.771 ± 1.234
3.387ThrArg: 3.387 ± 1.508
4.31ThrSer: 4.31 ± 0.662
5.234ThrThr: 5.234 ± 1.111
3.079ThrVal: 3.079 ± 2.07
2.155ThrTrp: 2.155 ± 0.761
0.308ThrTyr: 0.308 ± 0.137
0.0ThrXaa: 0.0 ± 0.0
Val
5.234ValAla: 5.234 ± 1.111
1.232ValCys: 1.232 ± 0.548
3.079ValAsp: 3.079 ± 0.35
3.695ValGlu: 3.695 ± 0.784
3.695ValPhe: 3.695 ± 0.784
3.079ValGly: 3.079 ± 2.07
0.308ValHis: 0.308 ± 0.137
2.463ValIle: 2.463 ± 0.624
2.771ValLys: 2.771 ± 0.487
4.002ValLeu: 4.002 ± 0.799
2.155ValMet: 2.155 ± 0.761
2.463ValAsn: 2.463 ± 0.624
2.771ValPro: 2.771 ± 0.487
3.079ValGln: 3.079 ± 0.35
3.387ValArg: 3.387 ± 0.213
5.542ValSer: 5.542 ± 1.834
3.695ValThr: 3.695 ± 0.936
4.31ValVal: 4.31 ± 2.382
0.924ValTrp: 0.924 ± 0.449
1.539ValTyr: 1.539 ± 1.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.274
0.0TrpCys: 0.0 ± 0.0
1.847TrpAsp: 1.847 ± 0.038
0.924TrpGlu: 0.924 ± 0.411
0.616TrpPhe: 0.616 ± 0.274
0.924TrpGly: 0.924 ± 0.449
0.308TrpHis: 0.308 ± 0.723
2.463TrpIle: 2.463 ± 0.624
2.155TrpLys: 2.155 ± 0.959
1.232TrpLeu: 1.232 ± 0.548
0.0TrpMet: 0.0 ± 0.0
2.155TrpAsn: 2.155 ± 0.099
0.616TrpPro: 0.616 ± 1.446
0.0TrpGln: 0.0 ± 0.0
1.232TrpArg: 1.232 ± 0.548
1.539TrpSer: 1.539 ± 0.685
2.155TrpThr: 2.155 ± 0.761
0.308TrpVal: 0.308 ± 0.137
0.308TrpTrp: 0.308 ± 0.137
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.232TyrAla: 1.232 ± 0.312
0.308TyrCys: 0.308 ± 0.137
2.463TyrAsp: 2.463 ± 1.096
0.924TyrGlu: 0.924 ± 0.449
1.539TyrPhe: 1.539 ± 0.175
2.155TyrGly: 2.155 ± 0.761
0.616TyrHis: 0.616 ± 0.274
0.924TyrIle: 0.924 ± 0.411
1.232TyrLys: 1.232 ± 0.548
2.771TyrLeu: 2.771 ± 0.373
0.0TyrMet: 0.0 ± 0.0
1.232TyrAsn: 1.232 ± 0.548
1.232TyrPro: 1.232 ± 0.312
1.539TyrGln: 1.539 ± 0.685
1.847TyrArg: 1.847 ± 0.898
1.232TyrSer: 1.232 ± 0.312
1.847TyrThr: 1.847 ± 0.822
0.616TyrVal: 0.616 ± 0.274
0.924TyrTrp: 0.924 ± 0.449
1.232TyrTyr: 1.232 ± 0.548
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3249 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski