Amino acid dipepetide frequency for Duck hepatitis B virus (strain Germany/DHBV-3) (DHBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.166AlaAla: 2.166 ± 0.646
1.083AlaCys: 1.083 ± 0.522
0.541AlaAsp: 0.541 ± 0.357
2.707AlaGlu: 2.707 ± 1.408
3.249AlaPhe: 3.249 ± 1.104
7.038AlaGly: 7.038 ± 3.504
1.083AlaHis: 1.083 ± 0.802
2.707AlaIle: 2.707 ± 0.844
3.79AlaLys: 3.79 ± 1.043
7.038AlaLeu: 7.038 ± 2.93
0.541AlaMet: 0.541 ± 0.357
2.707AlaAsn: 2.707 ± 1.408
3.79AlaPro: 3.79 ± 1.043
2.166AlaGln: 2.166 ± 1.092
4.873AlaArg: 4.873 ± 0.88
4.331AlaSer: 4.331 ± 0.441
5.956AlaThr: 5.956 ± 0.879
4.331AlaVal: 4.331 ± 1.043
0.541AlaTrp: 0.541 ± 0.546
1.624AlaTyr: 1.624 ± 0.641
0.0AlaXaa: 0.0 ± 0.0
Cys
2.166CysAla: 2.166 ± 0.728
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.541CysGly: 0.541 ± 0.357
1.083CysHis: 1.083 ± 0.713
0.541CysIle: 0.541 ± 0.357
0.541CysLys: 0.541 ± 0.357
2.707CysLeu: 2.707 ± 1.02
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.707CysPro: 2.707 ± 1.614
0.541CysGln: 0.541 ± 0.547
0.541CysArg: 0.541 ± 0.357
0.541CysSer: 0.541 ± 0.357
1.083CysThr: 1.083 ± 0.522
0.541CysVal: 0.541 ± 0.547
0.541CysTrp: 0.541 ± 0.357
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.79AspAla: 3.79 ± 2.2
0.0AspCys: 0.0 ± 0.0
3.249AspAsp: 3.249 ± 1.276
1.624AspGlu: 1.624 ± 0.641
3.249AspPhe: 3.249 ± 1.283
2.166AspGly: 2.166 ± 1.427
1.624AspHis: 1.624 ± 0.956
2.707AspIle: 2.707 ± 0.844
2.166AspLys: 2.166 ± 1.427
5.956AspLeu: 5.956 ± 2.67
0.541AspMet: 0.541 ± 0.357
1.624AspAsn: 1.624 ± 0.74
1.624AspPro: 1.624 ± 1.372
1.083AspGln: 1.083 ± 0.472
1.083AspArg: 1.083 ± 0.802
5.414AspSer: 5.414 ± 2.049
2.166AspThr: 2.166 ± 0.728
1.083AspVal: 1.083 ± 0.472
2.166AspTrp: 2.166 ± 0.728
1.083AspTyr: 1.083 ± 0.802
0.0AspXaa: 0.0 ± 0.0
Glu
7.038GluAla: 7.038 ± 2.274
0.541GluCys: 0.541 ± 0.357
2.166GluAsp: 2.166 ± 0.531
7.038GluGlu: 7.038 ± 1.321
0.0GluPhe: 0.0 ± 0.0
0.541GluGly: 0.541 ± 0.546
0.541GluHis: 0.541 ± 0.357
5.414GluIle: 5.414 ± 1.462
1.083GluLys: 1.083 ± 0.713
2.166GluLeu: 2.166 ± 0.913
0.541GluMet: 0.541 ± 0.357
1.624GluAsn: 1.624 ± 1.07
3.249GluPro: 3.249 ± 1.276
0.541GluGln: 0.541 ± 0.357
3.79GluArg: 3.79 ± 2.326
2.707GluSer: 2.707 ± 0.844
1.624GluThr: 1.624 ± 0.956
0.541GluVal: 0.541 ± 0.357
1.083GluTrp: 1.083 ± 0.713
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.083PheAla: 1.083 ± 0.713
0.0PheCys: 0.0 ± 0.0
1.083PheAsp: 1.083 ± 0.713
0.0PheGlu: 0.0 ± 0.0
2.166PhePhe: 2.166 ± 1.092
2.707PheGly: 2.707 ± 1.74
0.0PheHis: 0.0 ± 0.0
1.083PheIle: 1.083 ± 0.905
1.624PheLys: 1.624 ± 0.641
5.956PheLeu: 5.956 ± 2.136
0.541PheMet: 0.541 ± 0.357
1.083PheAsn: 1.083 ± 0.713
3.249PhePro: 3.249 ± 0.419
2.166PheGln: 2.166 ± 1.092
1.083PheArg: 1.083 ± 0.472
4.331PheSer: 4.331 ± 1.733
2.166PheThr: 2.166 ± 0.991
3.249PheVal: 3.249 ± 1.115
1.624PheTrp: 1.624 ± 0.641
1.624PheTyr: 1.624 ± 0.74
0.0PheXaa: 0.0 ± 0.0
Gly
3.79GlyAla: 3.79 ± 1.383
1.624GlyCys: 1.624 ± 0.74
2.707GlyAsp: 2.707 ± 0.319
3.79GlyGlu: 3.79 ± 0.71
1.624GlyPhe: 1.624 ± 0.74
3.79GlyGly: 3.79 ± 0.785
0.541GlyHis: 0.541 ± 0.357
2.707GlyIle: 2.707 ± 0.495
5.414GlyLys: 5.414 ± 1.81
7.038GlyLeu: 7.038 ± 3.464
1.624GlyMet: 1.624 ± 0.641
2.707GlyAsn: 2.707 ± 1.233
1.083GlyPro: 1.083 ± 0.713
2.166GlyGln: 2.166 ± 0.913
5.414GlyArg: 5.414 ± 1.783
5.414GlySer: 5.414 ± 0.886
3.79GlyThr: 3.79 ± 0.785
1.624GlyVal: 1.624 ± 1.07
0.0GlyTrp: 0.0 ± 0.0
1.624GlyTyr: 1.624 ± 0.641
0.0GlyXaa: 0.0 ± 0.0
His
1.624HisAla: 1.624 ± 0.641
0.0HisCys: 0.0 ± 0.0
1.083HisAsp: 1.083 ± 0.802
1.624HisGlu: 1.624 ± 0.641
2.707HisPhe: 2.707 ± 0.844
0.541HisGly: 0.541 ± 0.357
2.166HisHis: 2.166 ± 0.659
1.083HisIle: 1.083 ± 0.713
0.541HisLys: 0.541 ± 0.357
4.873HisLeu: 4.873 ± 2.089
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.707HisPro: 2.707 ± 1.058
1.624HisGln: 1.624 ± 0.641
2.166HisArg: 2.166 ± 0.659
0.541HisSer: 0.541 ± 0.357
0.0HisThr: 0.0 ± 0.0
2.707HisVal: 2.707 ± 0.866
0.0HisTrp: 0.0 ± 0.0
2.166HisTyr: 2.166 ± 1.092
0.0HisXaa: 0.0 ± 0.0
Ile
2.166IleAla: 2.166 ± 1.604
0.0IleCys: 0.0 ± 0.0
2.166IleAsp: 2.166 ± 0.531
3.79IleGlu: 3.79 ± 1.043
1.624IlePhe: 1.624 ± 0.919
1.083IleGly: 1.083 ± 0.905
1.624IleHis: 1.624 ± 0.641
1.624IleIle: 1.624 ± 0.74
5.414IleLys: 5.414 ± 0.636
6.497IleLeu: 6.497 ± 4.123
0.541IleMet: 0.541 ± 0.357
4.873IleAsn: 4.873 ± 0.88
5.414IlePro: 5.414 ± 1.037
3.79IleGln: 3.79 ± 1.251
1.624IleArg: 1.624 ± 1.07
8.663IleSer: 8.663 ± 1.7
4.873IleThr: 4.873 ± 1.628
2.166IleVal: 2.166 ± 1.083
1.083IleTrp: 1.083 ± 0.905
0.541IleTyr: 0.541 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
3.249LysAla: 3.249 ± 0.419
0.541LysCys: 0.541 ± 0.357
1.624LysAsp: 1.624 ± 0.641
1.083LysGlu: 1.083 ± 0.713
1.083LysPhe: 1.083 ± 0.472
4.873LysGly: 4.873 ± 1.687
3.249LysHis: 3.249 ± 1.283
4.873LysIle: 4.873 ± 0.803
2.707LysLys: 2.707 ± 0.866
4.873LysLeu: 4.873 ± 1.687
2.707LysMet: 2.707 ± 1.413
1.083LysAsn: 1.083 ± 0.713
3.79LysPro: 3.79 ± 1.425
1.624LysGln: 1.624 ± 1.07
1.624LysArg: 1.624 ± 1.07
5.956LysSer: 5.956 ± 1.255
3.249LysThr: 3.249 ± 1.276
2.707LysVal: 2.707 ± 0.844
0.541LysTrp: 0.541 ± 0.357
2.166LysTyr: 2.166 ± 1.427
0.0LysXaa: 0.0 ± 0.0
Leu
5.956LeuAla: 5.956 ± 1.255
1.083LeuCys: 1.083 ± 0.713
3.79LeuAsp: 3.79 ± 0.785
5.414LeuGlu: 5.414 ± 0.806
5.414LeuPhe: 5.414 ± 2.145
5.956LeuGly: 5.956 ± 0.917
1.083LeuHis: 1.083 ± 0.713
7.58LeuIle: 7.58 ± 3.957
3.79LeuLys: 3.79 ± 0.71
15.16LeuLeu: 15.16 ± 5.358
0.541LeuMet: 0.541 ± 0.357
2.166LeuAsn: 2.166 ± 0.913
7.58LeuPro: 7.58 ± 1.434
2.707LeuGln: 2.707 ± 1.693
8.121LeuArg: 8.121 ± 0.55
10.287LeuSer: 10.287 ± 1.303
5.414LeuThr: 5.414 ± 2.628
8.663LeuVal: 8.663 ± 2.261
3.249LeuTrp: 3.249 ± 1.689
4.873LeuTyr: 4.873 ± 1.086
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.166MetAsp: 2.166 ± 0.531
0.541MetGlu: 0.541 ± 0.546
0.0MetPhe: 0.0 ± 0.0
2.166MetGly: 2.166 ± 0.913
1.083MetHis: 1.083 ± 0.802
1.083MetIle: 1.083 ± 0.472
1.083MetLys: 1.083 ± 0.713
0.541MetLeu: 0.541 ± 0.357
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.083MetPro: 1.083 ± 0.713
1.624MetGln: 1.624 ± 0.74
0.541MetArg: 0.541 ± 0.357
1.083MetSer: 1.083 ± 0.905
1.624MetThr: 1.624 ± 0.74
0.541MetVal: 0.541 ± 0.357
0.541MetTrp: 0.541 ± 0.547
0.541MetTyr: 0.541 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
2.166AsnAla: 2.166 ± 0.659
1.083AsnCys: 1.083 ± 0.802
1.083AsnAsp: 1.083 ± 0.713
1.624AsnGlu: 1.624 ± 0.641
1.624AsnPhe: 1.624 ± 1.07
2.166AsnGly: 2.166 ± 0.728
0.0AsnHis: 0.0 ± 0.0
0.541AsnIle: 0.541 ± 0.357
0.541AsnLys: 0.541 ± 0.357
2.166AsnLeu: 2.166 ± 0.991
0.0AsnMet: 0.0 ± 0.0
1.083AsnAsn: 1.083 ± 0.713
2.707AsnPro: 2.707 ± 1.233
2.166AsnGln: 2.166 ± 0.943
1.624AsnArg: 1.624 ± 1.07
1.624AsnSer: 1.624 ± 1.07
1.624AsnThr: 1.624 ± 0.633
4.331AsnVal: 4.331 ± 1.318
1.083AsnTrp: 1.083 ± 0.713
1.083AsnTyr: 1.083 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
4.873ProAla: 4.873 ± 1.973
0.541ProCys: 0.541 ± 0.357
4.873ProAsp: 4.873 ± 1.471
2.707ProGlu: 2.707 ± 0.866
1.083ProPhe: 1.083 ± 0.713
2.707ProGly: 2.707 ± 1.089
1.624ProHis: 1.624 ± 0.74
3.249ProIle: 3.249 ± 1.283
5.956ProLys: 5.956 ± 1.255
8.121ProLeu: 8.121 ± 1.267
1.083ProMet: 1.083 ± 0.713
2.166ProAsn: 2.166 ± 1.427
3.79ProPro: 3.79 ± 1.108
4.873ProGln: 4.873 ± 1.329
8.121ProArg: 8.121 ± 2.545
4.331ProSer: 4.331 ± 0.441
5.414ProThr: 5.414 ± 2.212
3.249ProVal: 3.249 ± 0.419
1.624ProTrp: 1.624 ± 0.74
2.166ProTyr: 2.166 ± 0.659
0.0ProXaa: 0.0 ± 0.0
Gln
1.083GlnAla: 1.083 ± 0.713
1.083GlnCys: 1.083 ± 0.905
1.083GlnAsp: 1.083 ± 0.905
3.79GlnGlu: 3.79 ± 0.487
0.541GlnPhe: 0.541 ± 0.357
4.331GlnGly: 4.331 ± 2.727
3.249GlnHis: 3.249 ± 0.419
2.166GlnIle: 2.166 ± 0.728
0.541GlnLys: 0.541 ± 0.546
3.79GlnLeu: 3.79 ± 1.228
0.541GlnMet: 0.541 ± 0.357
0.541GlnAsn: 0.541 ± 0.357
3.249GlnPro: 3.249 ± 0.605
1.624GlnGln: 1.624 ± 0.956
1.624GlnArg: 1.624 ± 0.641
2.166GlnSer: 2.166 ± 0.913
3.79GlnThr: 3.79 ± 1.559
2.166GlnVal: 2.166 ± 0.659
2.707GlnTrp: 2.707 ± 1.693
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.497ArgAla: 6.497 ± 3.605
0.541ArgCys: 0.541 ± 0.357
3.79ArgAsp: 3.79 ± 1.251
2.707ArgGlu: 2.707 ± 0.319
2.166ArgPhe: 2.166 ± 1.427
3.249ArgGly: 3.249 ± 1.283
1.624ArgHis: 1.624 ± 1.07
7.58ArgIle: 7.58 ± 1.836
5.414ArgLys: 5.414 ± 2.816
7.038ArgLeu: 7.038 ± 0.605
0.541ArgMet: 0.541 ± 0.546
1.083ArgAsn: 1.083 ± 0.713
2.707ArgPro: 2.707 ± 0.319
1.083ArgGln: 1.083 ± 0.713
14.618ArgArg: 14.618 ± 2.095
5.956ArgSer: 5.956 ± 2.67
2.707ArgThr: 2.707 ± 0.844
2.166ArgVal: 2.166 ± 0.913
0.541ArgTrp: 0.541 ± 0.357
2.707ArgTyr: 2.707 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
4.873SerAla: 4.873 ± 2.219
2.707SerCys: 2.707 ± 0.942
5.414SerAsp: 5.414 ± 1.106
0.541SerGlu: 0.541 ± 0.357
3.79SerPhe: 3.79 ± 0.718
3.79SerGly: 3.79 ± 0.785
2.707SerHis: 2.707 ± 0.844
4.873SerIle: 4.873 ± 0.762
5.414SerLys: 5.414 ± 2.432
7.58SerLeu: 7.58 ± 3.236
1.624SerMet: 1.624 ± 1.005
2.166SerAsn: 2.166 ± 0.913
8.663SerPro: 8.663 ± 1.177
1.083SerGln: 1.083 ± 0.713
8.663SerArg: 8.663 ± 2.783
14.077SerSer: 14.077 ± 2.05
5.414SerThr: 5.414 ± 0.67
2.707SerVal: 2.707 ± 0.875
0.541SerTrp: 0.541 ± 0.357
0.541SerTyr: 0.541 ± 0.357
0.0SerXaa: 0.0 ± 0.0
Thr
2.166ThrAla: 2.166 ± 0.659
1.083ThrCys: 1.083 ± 0.713
2.166ThrAsp: 2.166 ± 0.728
1.624ThrGlu: 1.624 ± 0.633
4.331ThrPhe: 4.331 ± 1.455
3.249ThrGly: 3.249 ± 1.121
2.707ThrHis: 2.707 ± 0.319
3.79ThrIle: 3.79 ± 1.966
1.083ThrLys: 1.083 ± 0.713
7.58ThrLeu: 7.58 ± 2.602
1.624ThrMet: 1.624 ± 0.606
1.624ThrAsn: 1.624 ± 0.641
8.121ThrPro: 8.121 ± 1.514
3.79ThrGln: 3.79 ± 2.233
3.249ThrArg: 3.249 ± 1.283
4.873ThrSer: 4.873 ± 0.588
9.746ThrThr: 9.746 ± 2.53
2.166ThrVal: 2.166 ± 1.083
1.624ThrTrp: 1.624 ± 0.847
1.624ThrTyr: 1.624 ± 0.74
0.0ThrXaa: 0.0 ± 0.0
Val
3.79ValAla: 3.79 ± 1.533
2.166ValCys: 2.166 ± 0.991
3.249ValAsp: 3.249 ± 1.115
0.541ValGlu: 0.541 ± 0.357
0.541ValPhe: 0.541 ± 0.357
3.79ValGly: 3.79 ± 1.913
0.0ValHis: 0.0 ± 0.0
3.249ValIle: 3.249 ± 0.419
1.624ValLys: 1.624 ± 0.641
3.249ValLeu: 3.249 ± 0.419
0.541ValMet: 0.541 ± 0.357
1.083ValAsn: 1.083 ± 0.713
3.79ValPro: 3.79 ± 0.711
1.624ValGln: 1.624 ± 0.633
3.249ValArg: 3.249 ± 0.419
4.331ValSer: 4.331 ± 1.255
4.331ValThr: 4.331 ± 0.489
2.166ValVal: 2.166 ± 0.659
1.083ValTrp: 1.083 ± 0.713
3.249ValTyr: 3.249 ± 1.832
0.0ValXaa: 0.0 ± 0.0
Trp
1.083TrpAla: 1.083 ± 0.905
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.541TrpGlu: 0.541 ± 0.357
0.0TrpPhe: 0.0 ± 0.0
2.707TrpGly: 2.707 ± 0.749
1.624TrpHis: 1.624 ± 0.641
1.083TrpIle: 1.083 ± 0.905
2.707TrpLys: 2.707 ± 1.233
1.624TrpLeu: 1.624 ± 0.641
1.083TrpMet: 1.083 ± 0.905
0.541TrpAsn: 0.541 ± 0.547
1.624TrpPro: 1.624 ± 0.633
1.083TrpGln: 1.083 ± 0.802
1.083TrpArg: 1.083 ± 0.802
1.083TrpSer: 1.083 ± 0.472
2.707TrpThr: 2.707 ± 1.089
0.541TrpVal: 0.541 ± 0.357
3.249TrpTrp: 3.249 ± 1.839
1.083TrpTyr: 1.083 ± 0.713
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.707TyrAla: 2.707 ± 0.749
0.0TyrCys: 0.0 ± 0.0
2.166TyrAsp: 2.166 ± 1.083
0.541TyrGlu: 0.541 ± 0.357
1.624TyrPhe: 1.624 ± 1.07
1.083TyrGly: 1.083 ± 0.802
0.541TyrHis: 0.541 ± 0.357
1.624TyrIle: 1.624 ± 0.641
2.166TyrLys: 2.166 ± 0.728
4.873TyrLeu: 4.873 ± 1.026
1.083TyrMet: 1.083 ± 0.522
2.166TyrAsn: 2.166 ± 0.659
1.624TyrPro: 1.624 ± 0.641
2.707TyrGln: 2.707 ± 1.058
1.624TyrArg: 1.624 ± 0.641
0.0TyrSer: 0.0 ± 0.0
0.541TyrThr: 0.541 ± 0.357
0.0TyrVal: 0.0 ± 0.0
1.624TyrTrp: 1.624 ± 0.641
0.541TyrTyr: 0.541 ± 0.357
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski