Amino acid dipepetide frequency for Duck hepatitis B virus (isolate white Shanghai duck S31) (DHBV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.782AlaAla: 3.782 ± 0.784
1.621AlaCys: 1.621 ± 0.723
0.54AlaAsp: 0.54 ± 0.355
2.701AlaGlu: 2.701 ± 1.217
2.701AlaPhe: 2.701 ± 0.969
9.184AlaGly: 9.184 ± 3.782
1.08AlaHis: 1.08 ± 0.706
3.241AlaIle: 3.241 ± 1.109
5.943AlaLys: 5.943 ± 1.221
7.023AlaLeu: 7.023 ± 2.423
1.08AlaMet: 1.08 ± 0.709
2.701AlaAsn: 2.701 ± 1.217
5.402AlaPro: 5.402 ± 1.46
2.161AlaGln: 2.161 ± 1.099
4.862AlaArg: 4.862 ± 0.833
2.701AlaSer: 2.701 ± 0.818
5.402AlaThr: 5.402 ± 0.928
3.241AlaVal: 3.241 ± 0.463
1.08AlaTrp: 1.08 ± 0.429
1.621AlaTyr: 1.621 ± 0.552
0.0AlaXaa: 0.0 ± 0.0
Cys
1.621CysAla: 1.621 ± 0.723
0.0CysCys: 0.0 ± 0.0
0.54CysAsp: 0.54 ± 0.355
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.54CysGly: 0.54 ± 0.355
1.08CysHis: 1.08 ± 0.709
1.08CysIle: 1.08 ± 0.709
1.08CysLys: 1.08 ± 0.709
2.161CysLeu: 2.161 ± 1.014
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.701CysPro: 2.701 ± 1.604
0.54CysGln: 0.54 ± 0.584
0.0CysArg: 0.0 ± 0.0
1.08CysSer: 1.08 ± 0.709
1.08CysThr: 1.08 ± 0.553
0.0CysVal: 0.0 ± 0.0
0.54CysTrp: 0.54 ± 0.355
0.54CysTyr: 0.54 ± 0.355
0.0CysXaa: 0.0 ± 0.0
Asp
4.322AspAla: 4.322 ± 1.753
0.0AspCys: 0.0 ± 0.0
3.241AspAsp: 3.241 ± 1.177
0.0AspGlu: 0.0 ± 0.0
3.241AspPhe: 3.241 ± 1.104
1.621AspGly: 1.621 ± 1.064
1.08AspHis: 1.08 ± 1.052
2.701AspIle: 2.701 ± 0.818
1.621AspLys: 1.621 ± 1.064
5.943AspLeu: 5.943 ± 2.296
0.0AspMet: 0.0 ± 0.0
1.621AspAsn: 1.621 ± 0.723
0.54AspPro: 0.54 ± 0.526
2.701AspGln: 2.701 ± 0.969
1.08AspArg: 1.08 ± 0.706
2.701AspSer: 2.701 ± 0.961
2.701AspThr: 2.701 ± 0.818
1.08AspVal: 1.08 ± 0.429
2.161AspTrp: 2.161 ± 0.686
1.08AspTyr: 1.08 ± 0.706
0.0AspXaa: 0.0 ± 0.0
Glu
6.483GluAla: 6.483 ± 1.336
0.54GluCys: 0.54 ± 0.355
2.161GluAsp: 2.161 ± 0.535
8.644GluGlu: 8.644 ± 1.736
0.0GluPhe: 0.0 ± 0.0
0.54GluGly: 0.54 ± 0.526
0.0GluHis: 0.0 ± 0.0
3.782GluIle: 3.782 ± 1.281
2.161GluLys: 2.161 ± 0.601
3.241GluLeu: 3.241 ± 1.526
0.54GluMet: 0.54 ± 0.355
2.161GluAsn: 2.161 ± 0.868
3.782GluPro: 3.782 ± 1.358
0.0GluGln: 0.0 ± 0.0
3.241GluArg: 3.241 ± 1.177
4.322GluSer: 4.322 ± 0.974
1.621GluThr: 1.621 ± 0.892
0.54GluVal: 0.54 ± 0.526
0.54GluTrp: 0.54 ± 0.355
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.54PheAla: 0.54 ± 0.355
1.08PheCys: 1.08 ± 0.91
1.08PheAsp: 1.08 ± 0.709
0.0PheGlu: 0.0 ± 0.0
2.161PhePhe: 2.161 ± 1.099
2.701PheGly: 2.701 ± 1.802
0.0PheHis: 0.0 ± 0.0
1.08PheIle: 1.08 ± 0.91
1.621PheLys: 1.621 ± 0.552
5.943PheLeu: 5.943 ± 2.034
0.54PheMet: 0.54 ± 0.355
1.08PheAsn: 1.08 ± 0.709
2.701PhePro: 2.701 ± 0.352
2.161PheGln: 2.161 ± 1.099
1.08PheArg: 1.08 ± 0.429
3.782PheSer: 3.782 ± 1.349
2.161PheThr: 2.161 ± 0.995
4.322PheVal: 4.322 ± 1.202
1.621PheTrp: 1.621 ± 0.552
1.621PheTyr: 1.621 ± 0.723
0.0PheXaa: 0.0 ± 0.0
Gly
4.322GlyAla: 4.322 ± 1.115
1.621GlyCys: 1.621 ± 0.723
1.621GlyAsp: 1.621 ± 0.836
2.701GlyGlu: 2.701 ± 0.352
3.241GlyPhe: 3.241 ± 2.729
4.862GlyGly: 4.862 ± 0.416
0.54GlyHis: 0.54 ± 0.355
2.701GlyIle: 2.701 ± 0.536
6.483GlyLys: 6.483 ± 1.859
7.563GlyLeu: 7.563 ± 3.214
1.621GlyMet: 1.621 ± 0.552
2.161GlyAsn: 2.161 ± 1.419
0.54GlyPro: 0.54 ± 0.355
2.161GlyGln: 2.161 ± 0.868
6.483GlyArg: 6.483 ± 1.248
2.161GlySer: 2.161 ± 0.601
3.782GlyThr: 3.782 ± 0.776
2.161GlyVal: 2.161 ± 1.419
0.54GlyTrp: 0.54 ± 0.355
1.621GlyTyr: 1.621 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
1.621HisAla: 1.621 ± 0.552
0.0HisCys: 0.0 ± 0.0
0.54HisAsp: 0.54 ± 0.355
2.701HisGlu: 2.701 ± 1.217
2.161HisPhe: 2.161 ± 0.601
0.54HisGly: 0.54 ± 0.355
2.701HisHis: 2.701 ± 0.818
1.08HisIle: 1.08 ± 0.709
0.54HisLys: 0.54 ± 0.355
5.943HisLeu: 5.943 ± 2.106
0.54HisMet: 0.54 ± 0.355
0.0HisAsn: 0.0 ± 0.0
1.621HisPro: 1.621 ± 0.585
1.621HisGln: 1.621 ± 0.552
1.621HisArg: 1.621 ± 0.552
0.54HisSer: 0.54 ± 0.355
0.54HisThr: 0.54 ± 0.355
2.701HisVal: 2.701 ± 0.352
0.54HisTrp: 0.54 ± 0.355
2.161HisTyr: 2.161 ± 1.099
0.0HisXaa: 0.0 ± 0.0
Ile
2.701IleAla: 2.701 ± 1.217
0.0IleCys: 0.0 ± 0.0
2.701IleAsp: 2.701 ± 0.352
3.782IleGlu: 3.782 ± 0.951
1.621IlePhe: 1.621 ± 0.983
2.161IleGly: 2.161 ± 0.686
1.621IleHis: 1.621 ± 0.552
2.161IleIle: 2.161 ± 1.819
4.322IleLys: 4.322 ± 0.512
7.023IleLeu: 7.023 ± 3.919
0.54IleMet: 0.54 ± 0.355
3.782IleAsn: 3.782 ± 1.098
3.782IlePro: 3.782 ± 1.358
3.782IleGln: 3.782 ± 1.098
2.161IleArg: 2.161 ± 1.419
8.104IleSer: 8.104 ± 1.573
4.322IleThr: 4.322 ± 1.467
2.161IleVal: 2.161 ± 1.014
1.08IleTrp: 1.08 ± 0.91
0.54IleTyr: 0.54 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
3.241LysAla: 3.241 ± 0.463
0.54LysCys: 0.54 ± 0.355
1.08LysAsp: 1.08 ± 0.706
1.08LysGlu: 1.08 ± 0.709
1.621LysPhe: 1.621 ± 0.585
4.322LysGly: 4.322 ± 1.278
3.782LysHis: 3.782 ± 1.098
5.402LysIle: 5.402 ± 0.561
2.701LysLys: 2.701 ± 0.906
4.322LysLeu: 4.322 ± 1.278
2.701LysMet: 2.701 ± 1.39
2.701LysAsn: 2.701 ± 0.818
3.782LysPro: 3.782 ± 1.429
2.161LysGln: 2.161 ± 1.419
1.08LysArg: 1.08 ± 0.709
6.483LysSer: 6.483 ± 0.791
3.241LysThr: 3.241 ± 1.177
2.701LysVal: 2.701 ± 0.818
0.54LysTrp: 0.54 ± 0.355
3.782LysTyr: 3.782 ± 1.429
0.0LysXaa: 0.0 ± 0.0
Leu
5.943LeuAla: 5.943 ± 1.202
1.08LeuCys: 1.08 ± 0.709
3.782LeuAsp: 3.782 ± 0.776
5.402LeuGlu: 5.402 ± 1.46
5.402LeuPhe: 5.402 ± 2.207
5.402LeuGly: 5.402 ± 0.58
1.08LeuHis: 1.08 ± 0.709
7.563LeuIle: 7.563 ± 4.038
2.701LeuLys: 2.701 ± 0.818
17.288LeuLeu: 17.288 ± 7.724
1.621LeuMet: 1.621 ± 0.723
2.701LeuAsn: 2.701 ± 1.19
7.563LeuPro: 7.563 ± 1.438
3.782LeuGln: 3.782 ± 1.313
8.104LeuArg: 8.104 ± 0.954
10.265LeuSer: 10.265 ± 1.976
5.402LeuThr: 5.402 ± 1.329
8.644LeuVal: 8.644 ± 2.339
3.241LeuTrp: 3.241 ± 1.608
4.862LeuTyr: 4.862 ± 0.949
0.0LeuXaa: 0.0 ± 0.0
Met
1.621MetAla: 1.621 ± 0.723
0.0MetCys: 0.0 ± 0.0
2.161MetAsp: 2.161 ± 0.535
0.54MetGlu: 0.54 ± 0.526
1.08MetPhe: 1.08 ± 0.91
2.161MetGly: 2.161 ± 0.868
1.08MetHis: 1.08 ± 0.706
1.08MetIle: 1.08 ± 0.429
0.54MetLys: 0.54 ± 0.355
1.08MetLeu: 1.08 ± 0.709
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.08MetPro: 1.08 ± 0.709
1.621MetGln: 1.621 ± 0.723
1.08MetArg: 1.08 ± 0.709
1.08MetSer: 1.08 ± 0.91
1.08MetThr: 1.08 ± 0.91
1.08MetVal: 1.08 ± 0.709
0.54MetTrp: 0.54 ± 0.584
0.54MetTyr: 0.54 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
2.161AsnAla: 2.161 ± 0.601
1.621AsnCys: 1.621 ± 0.552
0.54AsnAsp: 0.54 ± 0.355
1.621AsnGlu: 1.621 ± 0.552
1.621AsnPhe: 1.621 ± 1.064
1.621AsnGly: 1.621 ± 1.064
0.54AsnHis: 0.54 ± 0.355
0.54AsnIle: 0.54 ± 0.355
1.621AsnLys: 1.621 ± 0.585
2.161AsnLeu: 2.161 ± 0.995
1.621AsnMet: 1.621 ± 1.657
1.08AsnAsn: 1.08 ± 0.709
2.701AsnPro: 2.701 ± 0.963
1.08AsnGln: 1.08 ± 0.429
2.161AsnArg: 2.161 ± 1.419
2.161AsnSer: 2.161 ± 1.419
1.621AsnThr: 1.621 ± 0.585
4.322AsnVal: 4.322 ± 1.202
0.54AsnTrp: 0.54 ± 0.355
1.08AsnTyr: 1.08 ± 0.706
0.0AsnXaa: 0.0 ± 0.0
Pro
5.402ProAla: 5.402 ± 1.926
0.0ProCys: 0.0 ± 0.0
4.322ProAsp: 4.322 ± 1.202
3.782ProGlu: 3.782 ± 1.144
1.08ProPhe: 1.08 ± 0.709
2.161ProGly: 2.161 ± 1.1
2.701ProHis: 2.701 ± 0.818
3.241ProIle: 3.241 ± 1.104
3.782ProLys: 3.782 ± 0.776
8.104ProLeu: 8.104 ± 1.403
1.08ProMet: 1.08 ± 0.709
2.161ProAsn: 2.161 ± 1.419
3.782ProPro: 3.782 ± 1.884
3.782ProGln: 3.782 ± 1.144
7.563ProArg: 7.563 ± 2.196
5.402ProSer: 5.402 ± 0.561
5.402ProThr: 5.402 ± 2.191
3.241ProVal: 3.241 ± 0.463
2.161ProTrp: 2.161 ± 1.1
1.621ProTyr: 1.621 ± 0.552
0.0ProXaa: 0.0 ± 0.0
Gln
1.08GlnAla: 1.08 ± 0.709
1.08GlnCys: 1.08 ± 0.91
1.08GlnAsp: 1.08 ± 0.91
3.241GlnGlu: 3.241 ± 0.664
0.54GlnPhe: 0.54 ± 0.355
3.241GlnGly: 3.241 ± 1.865
2.701GlnHis: 2.701 ± 0.818
1.621GlnIle: 1.621 ± 0.723
2.701GlnLys: 2.701 ± 0.969
2.701GlnLeu: 2.701 ± 0.969
0.54GlnMet: 0.54 ± 0.355
0.54GlnAsn: 0.54 ± 0.526
3.241GlnPro: 3.241 ± 1.381
1.621GlnGln: 1.621 ± 1.578
1.621GlnArg: 1.621 ± 0.552
2.161GlnSer: 2.161 ± 1.419
3.241GlnThr: 3.241 ± 1.093
3.241GlnVal: 3.241 ± 1.109
1.621GlnTrp: 1.621 ± 1.317
0.54GlnTyr: 0.54 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
7.563ArgAla: 7.563 ± 2.843
0.54ArgCys: 0.54 ± 0.355
3.241ArgAsp: 3.241 ± 1.104
3.782ArgGlu: 3.782 ± 0.745
2.161ArgPhe: 2.161 ± 1.419
3.782ArgGly: 3.782 ± 1.098
1.08ArgHis: 1.08 ± 0.709
6.483ArgIle: 6.483 ± 1.373
5.402ArgLys: 5.402 ± 2.565
8.104ArgLeu: 8.104 ± 0.651
1.08ArgMet: 1.08 ± 0.429
1.621ArgAsn: 1.621 ± 1.064
2.701ArgPro: 2.701 ± 0.352
0.0ArgGln: 0.0 ± 0.0
13.506ArgArg: 13.506 ± 2.251
5.402ArgSer: 5.402 ± 1.631
2.701ArgThr: 2.701 ± 0.818
0.54ArgVal: 0.54 ± 0.355
0.54ArgTrp: 0.54 ± 0.355
1.621ArgTyr: 1.621 ± 0.585
0.0ArgXaa: 0.0 ± 0.0
Ser
5.943SerAla: 5.943 ± 1.841
1.621SerCys: 1.621 ± 0.723
3.782SerAsp: 3.782 ± 0.297
0.54SerGlu: 0.54 ± 0.355
2.701SerPhe: 2.701 ± 0.961
2.701SerGly: 2.701 ± 0.352
2.701SerHis: 2.701 ± 0.818
4.862SerIle: 4.862 ± 1.077
4.862SerLys: 4.862 ± 2.102
8.104SerLeu: 8.104 ± 2.038
1.08SerMet: 1.08 ± 0.429
1.08SerAsn: 1.08 ± 0.709
11.885SerPro: 11.885 ± 0.72
1.08SerGln: 1.08 ± 0.709
8.104SerArg: 8.104 ± 2.455
14.587SerSer: 14.587 ± 2.365
3.782SerThr: 3.782 ± 1.213
2.701SerVal: 2.701 ± 0.818
0.54SerTrp: 0.54 ± 0.355
0.54SerTyr: 0.54 ± 0.355
0.0SerXaa: 0.0 ± 0.0
Thr
3.241ThrAla: 3.241 ± 1.104
0.54ThrCys: 0.54 ± 0.355
1.621ThrAsp: 1.621 ± 0.723
1.08ThrGlu: 1.08 ± 0.709
4.322ThrPhe: 4.322 ± 0.467
3.782ThrGly: 3.782 ± 0.297
2.701ThrHis: 2.701 ± 0.352
4.322ThrIle: 4.322 ± 0.759
0.54ThrLys: 0.54 ± 0.355
5.943ThrLeu: 5.943 ± 2.324
1.621ThrMet: 1.621 ± 0.552
3.241ThrAsn: 3.241 ± 0.463
5.402ThrPro: 5.402 ± 0.95
3.241ThrGln: 3.241 ± 1.89
3.241ThrArg: 3.241 ± 1.104
4.862ThrSer: 4.862 ± 0.627
8.104ThrThr: 8.104 ± 1.23
1.621ThrVal: 1.621 ± 0.543
1.621ThrTrp: 1.621 ± 0.836
2.161ThrTyr: 2.161 ± 0.686
0.0ThrXaa: 0.0 ± 0.0
Val
3.782ValAla: 3.782 ± 1.586
2.701ValCys: 2.701 ± 1.308
3.241ValAsp: 3.241 ± 1.109
0.54ValGlu: 0.54 ± 0.355
0.0ValPhe: 0.0 ± 0.0
3.782ValGly: 3.782 ± 1.349
0.0ValHis: 0.0 ± 0.0
2.701ValIle: 2.701 ± 0.352
2.161ValLys: 2.161 ± 0.601
2.161ValLeu: 2.161 ± 0.601
0.54ValMet: 0.54 ± 0.355
2.161ValAsn: 2.161 ± 0.868
4.322ValPro: 4.322 ± 0.853
1.621ValGln: 1.621 ± 0.585
3.241ValArg: 3.241 ± 0.463
3.782ValSer: 3.782 ± 0.297
3.782ValThr: 3.782 ± 0.776
1.08ValVal: 1.08 ± 0.709
0.54ValTrp: 0.54 ± 0.355
3.241ValTyr: 3.241 ± 1.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.621TrpGlu: 1.621 ± 0.723
0.0TrpPhe: 0.0 ± 0.0
2.701TrpGly: 2.701 ± 0.764
1.621TrpHis: 1.621 ± 0.552
1.08TrpIle: 1.08 ± 0.91
3.782TrpLys: 3.782 ± 0.745
1.621TrpLeu: 1.621 ± 0.552
1.08TrpMet: 1.08 ± 0.91
0.54TrpAsn: 0.54 ± 0.584
1.621TrpPro: 1.621 ± 0.585
1.08TrpGln: 1.08 ± 0.706
0.0TrpArg: 0.0 ± 0.0
1.08TrpSer: 1.08 ± 0.429
2.701TrpThr: 2.701 ± 0.969
0.0TrpVal: 0.0 ± 0.0
3.241TrpTrp: 3.241 ± 1.89
1.08TrpTyr: 1.08 ± 0.709
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.701TyrAla: 2.701 ± 0.764
0.0TyrCys: 0.0 ± 0.0
2.161TyrAsp: 2.161 ± 1.014
0.54TyrGlu: 0.54 ± 0.355
1.621TyrPhe: 1.621 ± 1.064
1.08TyrGly: 1.08 ± 0.706
1.621TyrHis: 1.621 ± 1.064
1.621TyrIle: 1.621 ± 0.552
2.701TyrLys: 2.701 ± 0.818
5.402TyrLeu: 5.402 ± 0.928
1.08TyrMet: 1.08 ± 0.553
1.621TyrAsn: 1.621 ± 0.552
1.621TyrPro: 1.621 ± 0.552
2.161TyrGln: 2.161 ± 0.858
1.621TyrArg: 1.621 ± 0.552
0.54TyrSer: 0.54 ± 0.355
0.54TyrThr: 0.54 ± 0.355
0.0TyrVal: 0.0 ± 0.0
1.621TyrTrp: 1.621 ± 0.552
0.54TyrTyr: 0.54 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski