Amino acid dipepetide frequency for Hepatitis B virus subtype adw2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.013AlaAla: 3.013 ± 1.21
0.861AlaCys: 0.861 ± 0.563
2.152AlaAsp: 2.152 ± 1.087
1.722AlaGlu: 1.722 ± 0.862
5.596AlaPhe: 5.596 ± 1.205
3.874AlaGly: 3.874 ± 0.697
0.861AlaHis: 0.861 ± 1.229
3.013AlaIle: 3.013 ± 0.472
2.583AlaLys: 2.583 ± 0.666
5.166AlaLeu: 5.166 ± 2.199
1.722AlaMet: 1.722 ± 1.035
1.291AlaAsn: 1.291 ± 0.817
3.444AlaPro: 3.444 ± 1.175
2.152AlaGln: 2.152 ± 0.764
4.305AlaArg: 4.305 ± 1.426
6.027AlaSer: 6.027 ± 1.36
2.152AlaThr: 2.152 ± 1.257
3.013AlaVal: 3.013 ± 1.326
0.43AlaTrp: 0.43 ± 0.281
1.722AlaTyr: 1.722 ± 0.893
0.0AlaXaa: 0.0 ± 0.0
Cys
1.291CysAla: 1.291 ± 1.844
2.583CysCys: 2.583 ± 1.295
0.43CysAsp: 0.43 ± 0.281
0.0CysGlu: 0.0 ± 0.0
1.291CysPhe: 1.291 ± 0.844
0.861CysGly: 0.861 ± 0.563
0.0CysHis: 0.0 ± 0.0
2.152CysIle: 2.152 ± 0.526
1.291CysLys: 1.291 ± 0.647
9.04CysLeu: 9.04 ± 1.867
1.291CysMet: 1.291 ± 0.98
0.43CysAsn: 0.43 ± 0.615
4.305CysPro: 4.305 ± 1.92
1.722CysGln: 1.722 ± 0.862
0.43CysArg: 0.43 ± 0.615
3.013CysSer: 3.013 ± 1.054
4.735CysThr: 4.735 ± 1.721
0.43CysVal: 0.43 ± 0.615
1.291CysTrp: 1.291 ± 0.587
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.861AspAla: 0.861 ± 0.563
0.43AspCys: 0.43 ± 0.615
1.291AspAsp: 1.291 ± 0.844
0.861AspGlu: 0.861 ± 0.563
1.722AspPhe: 1.722 ± 0.578
1.722AspGly: 1.722 ± 0.519
2.152AspHis: 2.152 ± 0.816
1.291AspIle: 1.291 ± 0.587
0.861AspLys: 0.861 ± 0.563
3.013AspLeu: 3.013 ± 1.38
0.0AspMet: 0.0 ± 0.0
0.43AspAsn: 0.43 ± 0.281
4.735AspPro: 4.735 ± 1.678
0.43AspGln: 0.43 ± 0.281
0.861AspArg: 0.861 ± 0.697
2.152AspSer: 2.152 ± 0.764
0.861AspThr: 0.861 ± 0.697
1.291AspVal: 1.291 ± 0.681
1.722AspTrp: 1.722 ± 0.616
1.291AspTyr: 1.291 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
1.722GluAla: 1.722 ± 0.893
0.0GluCys: 0.0 ± 0.0
2.152GluAsp: 2.152 ± 0.732
1.722GluGlu: 1.722 ± 1.169
1.291GluPhe: 1.291 ± 0.587
0.0GluGly: 0.0 ± 0.0
1.291GluHis: 1.291 ± 0.587
0.43GluIle: 0.43 ± 0.615
0.43GluLys: 0.43 ± 0.281
3.444GluLeu: 3.444 ± 1.32
0.0GluMet: 0.0 ± 0.0
1.291GluAsn: 1.291 ± 0.647
0.43GluPro: 0.43 ± 0.281
0.43GluGln: 0.43 ± 0.281
0.0GluArg: 0.0 ± 0.0
3.444GluSer: 3.444 ± 1.32
3.013GluThr: 3.013 ± 1.38
0.0GluVal: 0.0 ± 0.0
1.722GluTrp: 1.722 ± 0.519
1.722GluTyr: 1.722 ± 0.601
0.0GluXaa: 0.0 ± 0.0
Phe
3.444PheAla: 3.444 ± 1.118
2.152PheCys: 2.152 ± 0.526
0.0PheAsp: 0.0 ± 0.0
0.0PheGlu: 0.0 ± 0.0
4.305PhePhe: 4.305 ± 1.6
4.305PheGly: 4.305 ± 1.757
2.583PheHis: 2.583 ± 0.979
3.874PheIle: 3.874 ± 1.942
1.291PheLys: 1.291 ± 1.166
9.04PheLeu: 9.04 ± 2.503
0.43PheMet: 0.43 ± 0.281
0.861PheAsn: 0.861 ± 0.433
4.735PhePro: 4.735 ± 0.854
0.43PheGln: 0.43 ± 0.281
1.722PheArg: 1.722 ± 1.125
3.874PheSer: 3.874 ± 0.939
3.013PheThr: 3.013 ± 1.225
4.305PheVal: 4.305 ± 0.841
0.0PheTrp: 0.0 ± 0.0
0.861PheTyr: 0.861 ± 0.563
0.0PheXaa: 0.0 ± 0.0
Gly
3.444GlyAla: 3.444 ± 1.235
0.861GlyCys: 0.861 ± 0.584
1.291GlyAsp: 1.291 ± 0.505
1.722GlyGlu: 1.722 ± 0.893
4.305GlyPhe: 4.305 ± 1.528
3.874GlyGly: 3.874 ± 1.474
2.152GlyHis: 2.152 ± 1.362
2.583GlyIle: 2.583 ± 1.334
0.0GlyLys: 0.0 ± 0.0
7.318GlyLeu: 7.318 ± 1.051
2.152GlyMet: 2.152 ± 0.949
4.305GlyAsn: 4.305 ± 0.96
7.318GlyPro: 7.318 ± 1.358
1.291GlyGln: 1.291 ± 0.647
4.305GlyArg: 4.305 ± 1.125
5.166GlySer: 5.166 ± 1.551
5.596GlyThr: 5.596 ± 1.098
3.013GlyVal: 3.013 ± 0.762
2.152GlyTrp: 2.152 ± 0.824
2.583GlyTyr: 2.583 ± 0.666
0.0GlyXaa: 0.0 ± 0.0
His
0.43HisAla: 0.43 ± 0.281
2.583HisCys: 2.583 ± 0.948
0.43HisAsp: 0.43 ± 0.281
0.0HisGlu: 0.0 ± 0.0
0.861HisPhe: 0.861 ± 0.563
2.152HisGly: 2.152 ± 0.858
1.291HisHis: 1.291 ± 0.587
3.444HisIle: 3.444 ± 0.688
1.291HisLys: 1.291 ± 1.166
3.874HisLeu: 3.874 ± 1.755
0.43HisMet: 0.43 ± 0.657
0.43HisAsn: 0.43 ± 0.281
1.722HisPro: 1.722 ± 0.817
3.013HisGln: 3.013 ± 0.744
0.43HisArg: 0.43 ± 0.281
1.722HisSer: 1.722 ± 0.519
2.152HisThr: 2.152 ± 0.732
0.43HisVal: 0.43 ± 0.281
0.43HisTrp: 0.43 ± 0.43
0.861HisTyr: 0.861 ± 0.563
0.0HisXaa: 0.0 ± 0.0
Ile
0.861IleAla: 0.861 ± 0.577
1.722IleCys: 1.722 ± 0.519
1.722IleAsp: 1.722 ± 0.601
0.43IleGlu: 0.43 ± 0.281
4.305IlePhe: 4.305 ± 1.779
1.291IleGly: 1.291 ± 0.844
1.291IleHis: 1.291 ± 0.844
2.583IleIle: 2.583 ± 0.442
1.291IleLys: 1.291 ± 0.59
6.457IleLeu: 6.457 ± 1.152
1.291IleMet: 1.291 ± 0.505
0.43IleAsn: 0.43 ± 0.281
8.61IlePro: 8.61 ± 2.84
0.43IleGln: 0.43 ± 0.281
3.444IleArg: 3.444 ± 1.32
3.444IleSer: 3.444 ± 1.649
1.722IleThr: 1.722 ± 0.519
2.583IleVal: 2.583 ± 0.666
1.291IleTrp: 1.291 ± 0.647
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.291LysAla: 1.291 ± 0.844
0.0LysCys: 0.0 ± 0.0
1.291LysAsp: 1.291 ± 1.21
0.861LysGlu: 0.861 ± 0.697
0.43LysPhe: 0.43 ± 0.281
0.861LysGly: 0.861 ± 0.433
0.43LysHis: 0.43 ± 0.281
2.152LysIle: 2.152 ± 0.732
0.0LysLys: 0.0 ± 0.0
3.013LysLeu: 3.013 ± 1.28
0.0LysMet: 0.0 ± 0.0
0.43LysAsn: 0.43 ± 0.281
2.583LysPro: 2.583 ± 0.718
1.291LysGln: 1.291 ± 0.844
2.152LysArg: 2.152 ± 1.087
0.43LysSer: 0.43 ± 0.281
2.583LysThr: 2.583 ± 0.666
2.583LysVal: 2.583 ± 1.027
0.0LysTrp: 0.0 ± 0.0
2.152LysTyr: 2.152 ± 0.526
0.0LysXaa: 0.0 ± 0.0
Leu
3.444LeuAla: 3.444 ± 1.418
3.874LeuCys: 3.874 ± 1.623
5.596LeuAsp: 5.596 ± 0.811
3.444LeuGlu: 3.444 ± 2.15
1.722LeuPhe: 1.722 ± 0.751
10.762LeuGly: 10.762 ± 1.799
3.013LeuHis: 3.013 ± 1.591
3.874LeuIle: 3.874 ± 0.937
2.152LeuLys: 2.152 ± 0.836
18.08LeuLeu: 18.08 ± 3.203
1.722LeuMet: 1.722 ± 0.601
3.874LeuAsn: 3.874 ± 1.375
9.04LeuPro: 9.04 ± 1.36
4.735LeuGln: 4.735 ± 1.129
5.166LeuArg: 5.166 ± 1.514
13.345LeuSer: 13.345 ± 1.463
4.735LeuThr: 4.735 ± 1.117
8.179LeuVal: 8.179 ± 1.063
4.305LeuTrp: 4.305 ± 1.335
7.318LeuTyr: 7.318 ± 1.289
0.0LeuXaa: 0.0 ± 0.0
Met
0.43MetAla: 0.43 ± 0.615
1.291MetCys: 1.291 ± 0.647
1.291MetAsp: 1.291 ± 0.587
1.722MetGlu: 1.722 ± 0.793
1.291MetPhe: 1.291 ± 0.647
2.583MetGly: 2.583 ± 0.747
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.722MetLeu: 1.722 ± 0.519
1.291MetMet: 1.291 ± 0.647
0.43MetAsn: 0.43 ± 0.281
1.722MetPro: 1.722 ± 1.125
1.722MetGln: 1.722 ± 0.54
0.43MetArg: 0.43 ± 0.281
0.861MetSer: 0.861 ± 0.98
0.861MetThr: 0.861 ± 0.697
0.0MetVal: 0.0 ± 0.0
1.291MetTrp: 1.291 ± 0.647
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.291AsnAla: 1.291 ± 1.027
1.291AsnCys: 1.291 ± 0.647
0.0AsnAsp: 0.0 ± 0.0
0.43AsnGlu: 0.43 ± 0.281
3.013AsnPhe: 3.013 ± 0.553
0.0AsnGly: 0.0 ± 0.0
1.722AsnHis: 1.722 ± 0.519
2.152AsnIle: 2.152 ± 1.111
0.43AsnLys: 0.43 ± 0.281
4.735AsnLeu: 4.735 ± 1.799
1.722AsnMet: 1.722 ± 0.848
3.013AsnAsn: 3.013 ± 0.793
3.874AsnPro: 3.874 ± 1.594
0.861AsnGln: 0.861 ± 0.433
1.722AsnArg: 1.722 ± 0.817
5.596AsnSer: 5.596 ± 1.531
1.291AsnThr: 1.291 ± 0.587
0.43AsnVal: 0.43 ± 0.281
0.43AsnTrp: 0.43 ± 0.281
0.861AsnTyr: 0.861 ± 0.697
0.0AsnXaa: 0.0 ± 0.0
Pro
9.471ProAla: 9.471 ± 1.86
3.444ProCys: 3.444 ± 0.961
1.291ProAsp: 1.291 ± 0.817
2.152ProGlu: 2.152 ± 1.257
3.874ProPhe: 3.874 ± 1.006
3.874ProGly: 3.874 ± 1.233
3.013ProHis: 3.013 ± 0.793
7.318ProIle: 7.318 ± 1.649
2.152ProLys: 2.152 ± 0.563
9.901ProLeu: 9.901 ± 1.725
0.861ProMet: 0.861 ± 0.545
3.013ProAsn: 3.013 ± 0.684
5.596ProPro: 5.596 ± 1.835
2.583ProGln: 2.583 ± 1.126
3.874ProArg: 3.874 ± 1.743
12.914ProSer: 12.914 ± 1.437
6.457ProThr: 6.457 ± 1.48
5.596ProVal: 5.596 ± 1.286
1.291ProTrp: 1.291 ± 0.736
1.722ProTyr: 1.722 ± 0.601
0.0ProXaa: 0.0 ± 0.0
Gln
5.596GlnAla: 5.596 ± 1.639
1.291GlnCys: 1.291 ± 0.587
1.291GlnAsp: 1.291 ± 0.505
0.861GlnGlu: 0.861 ± 0.563
1.722GlnPhe: 1.722 ± 1.125
3.874GlnGly: 3.874 ± 1.233
1.291GlnHis: 1.291 ± 0.844
0.43GlnIle: 0.43 ± 0.615
0.43GlnLys: 0.43 ± 0.281
3.013GlnLeu: 3.013 ± 1.069
0.0GlnMet: 0.0 ± 0.0
1.722GlnAsn: 1.722 ± 0.519
2.152GlnPro: 2.152 ± 1.07
0.43GlnGln: 0.43 ± 0.281
1.722GlnArg: 1.722 ± 1.125
6.888GlnSer: 6.888 ± 0.824
0.43GlnThr: 0.43 ± 0.281
1.291GlnVal: 1.291 ± 0.59
2.152GlnTrp: 2.152 ± 1.111
0.861GlnTyr: 0.861 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
0.861ArgAla: 0.861 ± 0.563
1.291ArgCys: 1.291 ± 0.681
3.444ArgAsp: 3.444 ± 2.244
3.874ArgGlu: 3.874 ± 1.76
3.444ArgPhe: 3.444 ± 1.106
4.305ArgGly: 4.305 ± 1.054
1.291ArgHis: 1.291 ± 0.681
2.152ArgIle: 2.152 ± 0.526
2.583ArgLys: 2.583 ± 1.334
4.305ArgLeu: 4.305 ± 2.174
0.43ArgMet: 0.43 ± 0.281
0.861ArgAsn: 0.861 ± 0.563
3.444ArgPro: 3.444 ± 1.356
3.874ArgGln: 3.874 ± 1.067
11.623ArgArg: 11.623 ± 6.286
5.166ArgSer: 5.166 ± 1.802
3.874ArgThr: 3.874 ± 1.248
2.152ArgVal: 2.152 ± 1.406
1.722ArgTrp: 1.722 ± 0.519
0.43ArgTyr: 0.43 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
7.318SerAla: 7.318 ± 1.852
5.166SerCys: 5.166 ± 0.911
2.152SerAsp: 2.152 ± 0.898
0.43SerGlu: 0.43 ± 0.281
3.874SerPhe: 3.874 ± 1.309
6.027SerGly: 6.027 ± 1.287
3.444SerHis: 3.444 ± 1.099
3.013SerIle: 3.013 ± 0.882
2.152SerLys: 2.152 ± 0.732
9.04SerLeu: 9.04 ± 1.727
1.291SerMet: 1.291 ± 0.647
2.583SerAsn: 2.583 ± 0.718
14.636SerPro: 14.636 ± 3.442
6.888SerGln: 6.888 ± 1.46
7.318SerArg: 7.318 ± 2.529
10.331SerSer: 10.331 ± 2.049
5.596SerThr: 5.596 ± 1.037
5.596SerVal: 5.596 ± 1.127
5.596SerTrp: 5.596 ± 1.512
0.861SerTyr: 0.861 ± 0.563
0.0SerXaa: 0.0 ± 0.0
Thr
4.305ThrAla: 4.305 ± 0.973
3.013ThrCys: 3.013 ± 1.306
0.861ThrAsp: 0.861 ± 0.584
0.43ThrGlu: 0.43 ± 0.281
2.152ThrPhe: 2.152 ± 0.732
3.874ThrGly: 3.874 ± 0.977
0.861ThrHis: 0.861 ± 0.563
1.722ThrIle: 1.722 ± 0.9
2.152ThrLys: 2.152 ± 0.526
3.444ThrLeu: 3.444 ± 1.786
0.43ThrMet: 0.43 ± 0.281
3.444ThrAsn: 3.444 ± 1.013
5.166ThrPro: 5.166 ± 1.041
0.43ThrGln: 0.43 ± 0.281
3.013ThrArg: 3.013 ± 0.876
9.901ThrSer: 9.901 ± 2.211
8.179ThrThr: 8.179 ± 2.33
6.888ThrVal: 6.888 ± 2.253
0.861ThrTrp: 0.861 ± 0.697
0.43ThrTyr: 0.43 ± 0.281
0.0ThrXaa: 0.0 ± 0.0
Val
2.583ValAla: 2.583 ± 1.357
3.874ValCys: 3.874 ± 1.38
0.861ValAsp: 0.861 ± 0.563
1.722ValGlu: 1.722 ± 0.601
2.152ValPhe: 2.152 ± 1.237
6.027ValGly: 6.027 ± 0.653
0.861ValHis: 0.861 ± 0.563
0.43ValIle: 0.43 ± 0.281
0.0ValLys: 0.0 ± 0.0
6.457ValLeu: 6.457 ± 1.569
0.0ValMet: 0.0 ± 0.0
5.166ValAsn: 5.166 ± 1.038
4.305ValPro: 4.305 ± 0.804
2.583ValGln: 2.583 ± 0.442
5.166ValArg: 5.166 ± 1.004
4.735ValSer: 4.735 ± 0.651
1.722ValThr: 1.722 ± 0.579
3.874ValVal: 3.874 ± 1.309
2.152ValTrp: 2.152 ± 0.898
1.722ValTyr: 1.722 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
2.583TrpAla: 2.583 ± 1.295
0.0TrpCys: 0.0 ± 0.0
0.43TrpAsp: 0.43 ± 0.43
2.152TrpGlu: 2.152 ± 0.602
2.152TrpPhe: 2.152 ± 0.898
4.305TrpGly: 4.305 ± 0.683
0.0TrpHis: 0.0 ± 0.0
1.291TrpIle: 1.291 ± 0.587
1.291TrpLys: 1.291 ± 0.844
4.305TrpLeu: 4.305 ± 0.838
2.583TrpMet: 2.583 ± 1.295
0.861TrpAsn: 0.861 ± 0.577
0.861TrpPro: 0.861 ± 0.433
0.43TrpGln: 0.43 ± 0.281
0.43TrpArg: 0.43 ± 0.281
0.861TrpSer: 0.861 ± 0.861
1.722TrpThr: 1.722 ± 0.519
2.583TrpVal: 2.583 ± 1.256
1.722TrpTrp: 1.722 ± 0.519
1.291TrpTyr: 1.291 ± 0.647
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.861TyrAla: 0.861 ± 0.563
0.861TyrCys: 0.861 ± 0.584
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.722TyrPhe: 1.722 ± 0.594
0.861TyrGly: 0.861 ± 0.563
0.43TyrHis: 0.43 ± 0.281
1.722TyrIle: 1.722 ± 0.519
2.152TyrLys: 2.152 ± 0.732
3.013TyrLeu: 3.013 ± 0.472
0.861TyrMet: 0.861 ± 0.563
0.0TyrAsn: 0.0 ± 0.0
1.722TyrPro: 1.722 ± 1.125
1.722TyrGln: 1.722 ± 0.519
3.013TyrArg: 3.013 ± 1.471
3.444TyrSer: 3.444 ± 1.118
0.861TyrThr: 0.861 ± 0.563
2.152TyrVal: 2.152 ± 0.732
1.291TyrTrp: 1.291 ± 0.647
0.43TyrTyr: 0.43 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2324 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski