Amino acid dipepetide frequency for Duck hepatitis B virus (strain China) (DHBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.245AlaAla: 3.245 ± 0.613
1.622AlaCys: 1.622 ± 0.59
0.541AlaAsp: 0.541 ± 0.365
3.245AlaGlu: 3.245 ± 1.192
2.704AlaPhe: 2.704 ± 0.954
8.112AlaGly: 8.112 ± 2.672
1.082AlaHis: 1.082 ± 0.797
3.245AlaIle: 3.245 ± 1.057
5.949AlaLys: 5.949 ± 1.202
7.031AlaLeu: 7.031 ± 2.023
1.082AlaMet: 1.082 ± 0.731
2.704AlaAsn: 2.704 ± 1.359
4.867AlaPro: 4.867 ± 1.788
2.163AlaGln: 2.163 ± 1.142
5.949AlaArg: 5.949 ± 1.202
3.786AlaSer: 3.786 ± 0.314
5.408AlaThr: 5.408 ± 0.943
3.786AlaVal: 3.786 ± 0.608
1.082AlaTrp: 1.082 ± 0.461
1.622AlaTyr: 1.622 ± 0.596
0.0AlaXaa: 0.0 ± 0.0
Cys
1.622CysAla: 1.622 ± 0.719
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.541CysGly: 0.541 ± 0.365
1.082CysHis: 1.082 ± 0.731
1.082CysIle: 1.082 ± 0.731
0.541CysLys: 0.541 ± 0.365
2.163CysLeu: 2.163 ± 1.046
0.0CysMet: 0.0 ± 0.0
1.082CysAsn: 1.082 ± 0.731
2.704CysPro: 2.704 ± 1.564
0.541CysGln: 0.541 ± 0.584
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.622CysThr: 1.622 ± 0.59
0.0CysVal: 0.0 ± 0.0
0.541CysTrp: 0.541 ± 0.365
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.327AspAla: 4.327 ± 1.941
0.0AspCys: 0.0 ± 0.0
3.245AspAsp: 3.245 ± 1.303
0.0AspGlu: 0.0 ± 0.0
3.245AspPhe: 3.245 ± 1.192
1.622AspGly: 1.622 ± 1.096
1.082AspHis: 1.082 ± 0.976
2.163AspIle: 2.163 ± 0.585
1.622AspLys: 1.622 ± 1.096
5.949AspLeu: 5.949 ± 2.53
0.0AspMet: 0.0 ± 0.0
1.622AspAsn: 1.622 ± 0.719
1.622AspPro: 1.622 ± 0.674
2.704AspGln: 2.704 ± 0.954
1.622AspArg: 1.622 ± 0.596
3.786AspSer: 3.786 ± 1.032
2.163AspThr: 2.163 ± 0.727
1.082AspVal: 1.082 ± 0.461
2.163AspTrp: 2.163 ± 0.727
1.082AspTyr: 1.082 ± 0.797
0.0AspXaa: 0.0 ± 0.0
Glu
5.408GluAla: 5.408 ± 1.618
0.541GluCys: 0.541 ± 0.365
2.704GluAsp: 2.704 ± 0.386
8.112GluGlu: 8.112 ± 2.134
0.0GluPhe: 0.0 ± 0.0
0.541GluGly: 0.541 ± 0.488
0.541GluHis: 0.541 ± 0.365
5.949GluIle: 5.949 ± 1.76
2.163GluLys: 2.163 ± 0.585
2.163GluLeu: 2.163 ± 1.462
0.541GluMet: 0.541 ± 0.365
1.622GluAsn: 1.622 ± 1.096
3.245GluPro: 3.245 ± 1.303
0.541GluGln: 0.541 ± 0.365
2.704GluArg: 2.704 ± 1.597
4.327GluSer: 4.327 ± 0.892
1.622GluThr: 1.622 ± 0.877
0.0GluVal: 0.0 ± 0.0
0.541GluTrp: 0.541 ± 0.365
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.082PheAla: 1.082 ± 0.731
0.0PheCys: 0.0 ± 0.0
1.082PheAsp: 1.082 ± 0.731
0.0PheGlu: 0.0 ± 0.0
3.245PhePhe: 3.245 ± 1.875
2.704PheGly: 2.704 ± 1.862
0.0PheHis: 0.0 ± 0.0
1.082PheIle: 1.082 ± 0.879
2.704PheLys: 2.704 ± 0.797
5.949PheLeu: 5.949 ± 2.11
0.541PheMet: 0.541 ± 0.365
1.082PheAsn: 1.082 ± 0.731
2.163PhePro: 2.163 ± 0.585
2.163PheGln: 2.163 ± 1.142
1.082PheArg: 1.082 ± 0.461
3.786PheSer: 3.786 ± 1.471
2.163PheThr: 2.163 ± 0.867
3.245PheVal: 3.245 ± 1.057
1.622PheTrp: 1.622 ± 0.596
0.541PheTyr: 0.541 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
4.867GlyAla: 4.867 ± 1.585
2.163GlyCys: 2.163 ± 0.727
2.704GlyAsp: 2.704 ± 0.386
3.245GlyGlu: 3.245 ± 0.495
1.082GlyPhe: 1.082 ± 0.879
4.327GlyGly: 4.327 ± 0.456
0.541GlyHis: 0.541 ± 0.365
2.704GlyIle: 2.704 ± 0.6
5.949GlyLys: 5.949 ± 1.854
7.031GlyLeu: 7.031 ± 3.29
1.622GlyMet: 1.622 ± 0.596
2.163GlyAsn: 2.163 ± 1.462
0.541GlyPro: 0.541 ± 0.365
1.622GlyGln: 1.622 ± 0.674
6.49GlyArg: 6.49 ± 1.375
4.867GlySer: 4.867 ± 1.07
2.704GlyThr: 2.704 ± 0.797
1.622GlyVal: 1.622 ± 1.096
0.0GlyTrp: 0.0 ± 0.0
1.622GlyTyr: 1.622 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
1.622HisAla: 1.622 ± 0.596
0.0HisCys: 0.0 ± 0.0
0.541HisAsp: 0.541 ± 0.365
2.704HisGlu: 2.704 ± 1.359
2.163HisPhe: 2.163 ± 0.585
0.541HisGly: 0.541 ± 0.365
2.704HisHis: 2.704 ± 0.773
1.082HisIle: 1.082 ± 0.731
1.082HisLys: 1.082 ± 0.731
5.408HisLeu: 5.408 ± 1.816
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.622HisPro: 1.622 ± 0.674
2.163HisGln: 2.163 ± 0.585
1.622HisArg: 1.622 ± 0.596
1.082HisSer: 1.082 ± 0.731
0.0HisThr: 0.0 ± 0.0
2.163HisVal: 2.163 ± 0.566
0.0HisTrp: 0.0 ± 0.0
2.163HisTyr: 2.163 ± 1.142
0.0HisXaa: 0.0 ± 0.0
Ile
2.704IleAla: 2.704 ± 1.359
0.0IleCys: 0.0 ± 0.0
2.704IleAsp: 2.704 ± 0.386
3.245IleGlu: 3.245 ± 1.303
1.622IlePhe: 1.622 ± 1.063
1.082IleGly: 1.082 ± 0.879
1.622IleHis: 1.622 ± 0.596
1.082IleIle: 1.082 ± 0.879
5.408IleLys: 5.408 ± 0.65
7.572IleLeu: 7.572 ± 4.092
0.0IleMet: 0.0 ± 0.0
4.867IleAsn: 4.867 ± 0.863
4.867IlePro: 4.867 ± 1.388
3.245IleGln: 3.245 ± 1.192
1.622IleArg: 1.622 ± 1.096
9.194IleSer: 9.194 ± 1.967
4.327IleThr: 4.327 ± 1.461
2.163IleVal: 2.163 ± 1.046
1.082IleTrp: 1.082 ± 0.879
0.541IleTyr: 0.541 ± 0.365
0.0IleXaa: 0.0 ± 0.0
Lys
4.327LysAla: 4.327 ± 1.112
0.541LysCys: 0.541 ± 0.365
1.622LysAsp: 1.622 ± 0.596
1.082LysGlu: 1.082 ± 0.731
1.082LysPhe: 1.082 ± 0.731
4.327LysGly: 4.327 ± 1.439
4.327LysHis: 4.327 ± 1.17
4.867LysIle: 4.867 ± 0.813
2.704LysLys: 2.704 ± 0.873
6.49LysLeu: 6.49 ± 2.058
2.704LysMet: 2.704 ± 1.317
2.163LysAsn: 2.163 ± 0.727
3.786LysPro: 3.786 ± 1.381
1.622LysGln: 1.622 ± 1.096
1.622LysArg: 1.622 ± 1.096
5.949LysSer: 5.949 ± 1.062
3.245LysThr: 3.245 ± 1.303
3.245LysVal: 3.245 ± 1.057
1.082LysTrp: 1.082 ± 0.731
3.786LysTyr: 3.786 ± 0.78
0.0LysXaa: 0.0 ± 0.0
Leu
6.49LeuAla: 6.49 ± 1.074
1.082LeuCys: 1.082 ± 0.731
3.786LeuAsp: 3.786 ± 0.708
5.949LeuGlu: 5.949 ± 1.062
5.408LeuPhe: 5.408 ± 2.156
4.867LeuGly: 4.867 ± 0.794
1.082LeuHis: 1.082 ± 0.731
8.112LeuIle: 8.112 ± 3.764
3.245LeuLys: 3.245 ± 0.495
16.766LeuLeu: 16.766 ± 7.085
2.163LeuMet: 2.163 ± 0.727
2.704LeuAsn: 2.704 ± 1.319
7.572LeuPro: 7.572 ± 1.388
3.245LeuGln: 3.245 ± 1.33
7.572LeuArg: 7.572 ± 0.954
9.735LeuSer: 9.735 ± 1.563
5.949LeuThr: 5.949 ± 1.406
8.653LeuVal: 8.653 ± 2.413
3.245LeuTrp: 3.245 ± 1.762
4.867LeuTyr: 4.867 ± 1.061
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.163MetAsp: 2.163 ± 0.566
0.541MetGlu: 0.541 ± 0.488
1.082MetPhe: 1.082 ± 0.879
2.163MetGly: 2.163 ± 0.982
1.082MetHis: 1.082 ± 0.797
1.082MetIle: 1.082 ± 0.461
1.082MetLys: 1.082 ± 0.731
1.082MetLeu: 1.082 ± 0.731
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.082MetPro: 1.082 ± 0.731
2.163MetGln: 2.163 ± 0.727
0.541MetArg: 0.541 ± 0.365
1.082MetSer: 1.082 ± 0.879
1.082MetThr: 1.082 ± 0.879
1.082MetVal: 1.082 ± 0.731
0.541MetTrp: 0.541 ± 0.584
0.541MetTyr: 0.541 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
2.163AsnAla: 2.163 ± 0.585
1.082AsnCys: 1.082 ± 0.797
0.0AsnAsp: 0.0 ± 0.0
1.622AsnGlu: 1.622 ± 0.596
1.622AsnPhe: 1.622 ± 1.096
2.163AsnGly: 2.163 ± 0.727
0.541AsnHis: 0.541 ± 0.365
0.0AsnIle: 0.0 ± 0.0
1.622AsnLys: 1.622 ± 1.096
2.704AsnLeu: 2.704 ± 1.193
1.082AsnMet: 1.082 ± 0.659
1.082AsnAsn: 1.082 ± 0.731
3.245AsnPro: 3.245 ± 1.349
1.082AsnGln: 1.082 ± 0.461
1.622AsnArg: 1.622 ± 1.096
1.082AsnSer: 1.082 ± 0.731
1.622AsnThr: 1.622 ± 0.674
4.327AsnVal: 4.327 ± 1.17
0.541AsnTrp: 0.541 ± 0.365
1.082AsnTyr: 1.082 ± 0.797
0.0AsnXaa: 0.0 ± 0.0
Pro
4.327ProAla: 4.327 ± 1.761
0.541ProCys: 0.541 ± 0.365
4.327ProAsp: 4.327 ± 1.17
3.786ProGlu: 3.786 ± 1.092
1.082ProPhe: 1.082 ± 0.731
2.704ProGly: 2.704 ± 0.954
2.163ProHis: 2.163 ± 0.727
2.163ProIle: 2.163 ± 1.594
4.327ProLys: 4.327 ± 0.456
8.112ProLeu: 8.112 ± 1.384
1.622ProMet: 1.622 ± 1.096
2.163ProAsn: 2.163 ± 1.462
3.786ProPro: 3.786 ± 1.753
3.786ProGln: 3.786 ± 1.092
7.031ProArg: 7.031 ± 2.287
6.49ProSer: 6.49 ± 0.21
5.408ProThr: 5.408 ± 1.845
3.786ProVal: 3.786 ± 0.78
2.163ProTrp: 2.163 ± 1.028
1.622ProTyr: 1.622 ± 0.596
0.0ProXaa: 0.0 ± 0.0
Gln
1.082GlnAla: 1.082 ± 0.731
1.082GlnCys: 1.082 ± 0.879
1.082GlnAsp: 1.082 ± 0.879
3.245GlnGlu: 3.245 ± 0.658
0.541GlnPhe: 0.541 ± 0.365
3.245GlnGly: 3.245 ± 2.055
2.163GlnHis: 2.163 ± 0.585
3.245GlnIle: 3.245 ± 1.163
2.163GlnLys: 2.163 ± 1.028
3.245GlnLeu: 3.245 ± 1.015
0.541GlnMet: 0.541 ± 0.365
0.541GlnAsn: 0.541 ± 0.488
3.786GlnPro: 3.786 ± 1.092
1.622GlnGln: 1.622 ± 1.464
2.163GlnArg: 2.163 ± 0.585
2.163GlnSer: 2.163 ± 1.462
3.245GlnThr: 3.245 ± 1.03
1.622GlnVal: 1.622 ± 0.596
1.622GlnTrp: 1.622 ± 1.213
0.541GlnTyr: 0.541 ± 0.365
0.0GlnXaa: 0.0 ± 0.0
Arg
8.112ArgAla: 8.112 ± 2.98
0.541ArgCys: 0.541 ± 0.365
3.786ArgAsp: 3.786 ± 1.123
2.704ArgGlu: 2.704 ± 0.386
2.163ArgPhe: 2.163 ± 1.462
4.867ArgGly: 4.867 ± 1.321
1.082ArgHis: 1.082 ± 0.731
6.49ArgIle: 6.49 ± 1.241
7.031ArgLys: 7.031 ± 2.365
6.49ArgLeu: 6.49 ± 0.787
0.541ArgMet: 0.541 ± 0.488
1.082ArgAsn: 1.082 ± 0.731
2.704ArgPro: 2.704 ± 0.386
0.541ArgGln: 0.541 ± 0.365
10.817ArgArg: 10.817 ± 3.392
4.867ArgSer: 4.867 ± 1.788
2.163ArgThr: 2.163 ± 0.585
1.622ArgVal: 1.622 ± 0.674
0.541ArgTrp: 0.541 ± 0.365
1.622ArgTyr: 1.622 ± 0.674
0.0ArgXaa: 0.0 ± 0.0
Ser
6.49SerAla: 6.49 ± 2.181
1.622SerCys: 1.622 ± 1.063
3.245SerAsp: 3.245 ± 0.483
1.082SerGlu: 1.082 ± 0.731
3.245SerPhe: 3.245 ± 1.179
3.786SerGly: 3.786 ± 0.708
2.704SerHis: 2.704 ± 0.773
5.408SerIle: 5.408 ± 1.139
4.867SerLys: 4.867 ± 1.461
8.653SerLeu: 8.653 ± 2.113
1.082SerMet: 1.082 ± 0.461
1.082SerAsn: 1.082 ± 0.731
9.735SerPro: 9.735 ± 1.625
1.082SerGln: 1.082 ± 0.731
8.112SerArg: 8.112 ± 2.318
15.684SerSer: 15.684 ± 2.684
5.408SerThr: 5.408 ± 0.401
2.163SerVal: 2.163 ± 0.727
0.541SerTrp: 0.541 ± 0.365
1.082SerTyr: 1.082 ± 0.731
0.0SerXaa: 0.0 ± 0.0
Thr
3.786ThrAla: 3.786 ± 1.123
0.0ThrCys: 0.0 ± 0.0
2.163ThrAsp: 2.163 ± 0.727
1.082ThrGlu: 1.082 ± 0.731
3.786ThrPhe: 3.786 ± 1.471
3.245ThrGly: 3.245 ± 0.483
2.704ThrHis: 2.704 ± 0.386
4.327ThrIle: 4.327 ± 1.657
0.541ThrLys: 0.541 ± 0.365
6.49ThrLeu: 6.49 ± 2.182
1.622ThrMet: 1.622 ± 0.63
2.163ThrAsn: 2.163 ± 0.566
6.49ThrPro: 6.49 ± 1.351
3.245ThrGln: 3.245 ± 1.875
3.786ThrArg: 3.786 ± 1.123
3.786ThrSer: 3.786 ± 0.852
7.572ThrThr: 7.572 ± 1.203
2.163ThrVal: 2.163 ± 1.046
1.622ThrTrp: 1.622 ± 0.871
2.163ThrTyr: 2.163 ± 0.727
0.0ThrXaa: 0.0 ± 0.0
Val
3.786ValAla: 3.786 ± 1.568
2.704ValCys: 2.704 ± 1.193
3.245ValAsp: 3.245 ± 1.057
0.541ValGlu: 0.541 ± 0.365
0.541ValPhe: 0.541 ± 0.365
3.245ValGly: 3.245 ± 1.668
0.0ValHis: 0.0 ± 0.0
2.163ValIle: 2.163 ± 0.566
2.163ValLys: 2.163 ± 0.585
2.163ValLeu: 2.163 ± 0.585
0.541ValMet: 0.541 ± 0.365
0.541ValAsn: 0.541 ± 0.365
3.786ValPro: 3.786 ± 0.685
1.622ValGln: 1.622 ± 0.674
3.786ValArg: 3.786 ± 0.78
3.786ValSer: 3.786 ± 1.568
3.786ValThr: 3.786 ± 0.708
2.163ValVal: 2.163 ± 0.585
1.082ValTrp: 1.082 ± 0.731
3.245ValTyr: 3.245 ± 1.801
0.0ValXaa: 0.0 ± 0.0
Trp
1.082TrpAla: 1.082 ± 0.879
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.541TrpGlu: 0.541 ± 0.365
0.0TrpPhe: 0.0 ± 0.0
2.704TrpGly: 2.704 ± 0.797
1.622TrpHis: 1.622 ± 0.596
1.082TrpIle: 1.082 ± 0.879
3.786TrpLys: 3.786 ± 0.78
1.622TrpLeu: 1.622 ± 0.596
1.082TrpMet: 1.082 ± 0.879
0.541TrpAsn: 0.541 ± 0.584
1.622TrpPro: 1.622 ± 0.674
1.082TrpGln: 1.082 ± 0.797
0.0TrpArg: 0.0 ± 0.0
1.082TrpSer: 1.082 ± 0.461
2.704TrpThr: 2.704 ± 0.954
0.0TrpVal: 0.0 ± 0.0
3.245TrpTrp: 3.245 ± 1.875
1.082TrpTyr: 1.082 ± 0.731
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.704TyrAla: 2.704 ± 0.797
0.0TyrCys: 0.0 ± 0.0
2.163TyrAsp: 2.163 ± 1.046
0.541TyrGlu: 0.541 ± 0.365
1.622TyrPhe: 1.622 ± 1.096
1.082TyrGly: 1.082 ± 0.797
0.541TyrHis: 0.541 ± 0.365
1.622TyrIle: 1.622 ± 0.596
1.622TyrLys: 1.622 ± 1.096
5.408TyrLeu: 5.408 ± 0.943
1.082TyrMet: 1.082 ± 0.46
1.622TyrAsn: 1.622 ± 0.596
2.163TyrPro: 2.163 ± 0.566
2.704TyrGln: 2.704 ± 1.096
1.622TyrArg: 1.622 ± 0.596
0.541TyrSer: 0.541 ± 0.365
0.541TyrThr: 0.541 ± 0.365
0.0TyrVal: 0.0 ± 0.0
1.622TyrTrp: 1.622 ± 0.596
0.541TyrTyr: 0.541 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski