Amino acid dipepetide frequency for Roundleaf bat hepatitis B virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.784AlaAla: 8.784 ± 5.482
2.703AlaCys: 2.703 ± 1.096
4.73AlaAsp: 4.73 ± 1.192
2.703AlaGlu: 2.703 ± 0.901
3.378AlaPhe: 3.378 ± 0.913
6.081AlaGly: 6.081 ± 1.431
1.351AlaHis: 1.351 ± 0.757
2.703AlaIle: 2.703 ± 0.738
2.027AlaLys: 2.027 ± 0.694
6.757AlaLeu: 6.757 ± 1.409
0.676AlaMet: 0.676 ± 0.379
1.351AlaAsn: 1.351 ± 0.757
3.378AlaPro: 3.378 ± 1.165
1.351AlaGln: 1.351 ± 0.712
6.081AlaArg: 6.081 ± 2.818
8.108AlaSer: 8.108 ± 2.557
2.027AlaThr: 2.027 ± 0.746
2.703AlaVal: 2.703 ± 1.096
1.351AlaTrp: 1.351 ± 0.712
2.027AlaTyr: 2.027 ± 0.746
0.0AlaXaa: 0.0 ± 0.0
Cys
3.378CysAla: 3.378 ± 1.915
1.351CysCys: 1.351 ± 1.811
0.0CysAsp: 0.0 ± 0.0
0.676CysGlu: 0.676 ± 1.05
0.676CysPhe: 0.676 ± 0.906
2.027CysGly: 2.027 ± 1.136
0.0CysHis: 0.0 ± 0.0
0.676CysIle: 0.676 ± 0.379
0.676CysLys: 0.676 ± 0.906
5.405CysLeu: 5.405 ± 2.511
0.676CysMet: 0.676 ± 0.808
0.0CysAsn: 0.0 ± 0.0
1.351CysPro: 1.351 ± 1.811
0.676CysGln: 0.676 ± 0.379
0.676CysArg: 0.676 ± 1.05
4.054CysSer: 4.054 ± 1.325
4.054CysThr: 4.054 ± 2.137
1.351CysVal: 1.351 ± 0.922
2.027CysTrp: 2.027 ± 1.015
0.676CysTyr: 0.676 ± 0.379
0.0CysXaa: 0.0 ± 0.0
Asp
4.054AspAla: 4.054 ± 0.682
0.0AspCys: 0.0 ± 0.0
1.351AspAsp: 1.351 ± 0.757
0.0AspGlu: 0.0 ± 0.0
1.351AspPhe: 1.351 ± 0.766
0.0AspGly: 0.0 ± 0.0
0.676AspHis: 0.676 ± 0.379
0.676AspIle: 0.676 ± 0.95
1.351AspLys: 1.351 ± 0.757
4.054AspLeu: 4.054 ± 1.878
0.676AspMet: 0.676 ± 0.95
2.027AspAsn: 2.027 ± 1.136
1.351AspPro: 1.351 ± 1.427
0.676AspGln: 0.676 ± 0.379
0.676AspArg: 0.676 ± 0.379
1.351AspSer: 1.351 ± 0.712
2.027AspThr: 2.027 ± 1.136
2.703AspVal: 2.703 ± 1.096
2.703AspTrp: 2.703 ± 0.985
0.676AspTyr: 0.676 ± 1.05
0.0AspXaa: 0.0 ± 0.0
Glu
2.027GluAla: 2.027 ± 1.136
0.676GluCys: 0.676 ± 0.906
1.351GluAsp: 1.351 ± 0.766
3.378GluGlu: 3.378 ± 1.646
2.703GluPhe: 2.703 ± 1.531
2.703GluGly: 2.703 ± 0.901
2.703GluHis: 2.703 ± 1.531
0.0GluIle: 0.0 ± 0.0
0.676GluLys: 0.676 ± 0.379
6.081GluLeu: 6.081 ± 2.424
0.676GluMet: 0.676 ± 0.379
0.0GluAsn: 0.0 ± 0.0
0.676GluPro: 0.676 ± 1.05
1.351GluGln: 1.351 ± 0.757
1.351GluArg: 1.351 ± 0.922
1.351GluSer: 1.351 ± 0.757
2.703GluThr: 2.703 ± 1.897
0.676GluVal: 0.676 ± 0.379
0.0GluTrp: 0.0 ± 0.0
0.676GluTyr: 0.676 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
4.73PheAla: 4.73 ± 1.518
1.351PheCys: 1.351 ± 0.766
0.0PheAsp: 0.0 ± 0.0
0.676PheGlu: 0.676 ± 0.379
3.378PhePhe: 3.378 ± 1.474
4.054PheGly: 4.054 ± 2.468
2.703PheHis: 2.703 ± 1.897
1.351PheIle: 1.351 ± 0.712
1.351PheLys: 1.351 ± 0.757
8.108PheLeu: 8.108 ± 2.671
1.351PheMet: 1.351 ± 0.712
1.351PheAsn: 1.351 ± 0.757
6.757PhePro: 6.757 ± 0.617
0.676PheGln: 0.676 ± 0.906
3.378PheArg: 3.378 ± 1.136
6.081PheSer: 6.081 ± 1.776
2.703PheThr: 2.703 ± 1.096
1.351PheVal: 1.351 ± 1.45
0.676PheTrp: 0.676 ± 0.379
0.676PheTyr: 0.676 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
6.081GlyAla: 6.081 ± 1.749
1.351GlyCys: 1.351 ± 0.922
0.676GlyAsp: 0.676 ± 0.379
3.378GlyGlu: 3.378 ± 0.868
4.054GlyPhe: 4.054 ± 1.388
5.405GlyGly: 5.405 ± 1.026
1.351GlyHis: 1.351 ± 0.922
2.703GlyIle: 2.703 ± 0.862
2.703GlyLys: 2.703 ± 0.901
10.135GlyLeu: 10.135 ± 1.918
0.0GlyMet: 0.0 ± 0.0
2.027GlyAsn: 2.027 ± 1.585
6.081GlyPro: 6.081 ± 0.757
1.351GlyGln: 1.351 ± 0.766
4.73GlyArg: 4.73 ± 1.97
2.703GlySer: 2.703 ± 0.901
3.378GlyThr: 3.378 ± 1.165
4.054GlyVal: 4.054 ± 1.643
1.351GlyTrp: 1.351 ± 0.766
1.351GlyTyr: 1.351 ± 0.712
0.0GlyXaa: 0.0 ± 0.0
His
0.676HisAla: 0.676 ± 0.379
3.378HisCys: 3.378 ± 0.868
0.676HisAsp: 0.676 ± 0.379
0.676HisGlu: 0.676 ± 0.379
3.378HisPhe: 3.378 ± 1.345
1.351HisGly: 1.351 ± 0.757
2.027HisHis: 2.027 ± 0.746
1.351HisIle: 1.351 ± 0.757
2.027HisLys: 2.027 ± 0.939
7.432HisLeu: 7.432 ± 2.635
0.676HisMet: 0.676 ± 1.05
2.027HisAsn: 2.027 ± 1.136
1.351HisPro: 1.351 ± 0.757
2.027HisGln: 2.027 ± 1.136
2.027HisArg: 2.027 ± 0.939
3.378HisSer: 3.378 ± 1.136
2.027HisThr: 2.027 ± 1.684
0.0HisVal: 0.0 ± 0.0
0.676HisTrp: 0.676 ± 0.379
0.676HisTyr: 0.676 ± 0.379
0.0HisXaa: 0.0 ± 0.0
Ile
1.351IleAla: 1.351 ± 0.712
0.0IleCys: 0.0 ± 0.0
0.676IleAsp: 0.676 ± 0.95
1.351IleGlu: 1.351 ± 0.766
1.351IlePhe: 1.351 ± 1.343
0.0IleGly: 0.0 ± 0.0
1.351IleHis: 1.351 ± 0.757
0.676IleIle: 0.676 ± 0.906
1.351IleLys: 1.351 ± 0.922
4.054IleLeu: 4.054 ± 0.658
1.351IleMet: 1.351 ± 0.646
0.676IleAsn: 0.676 ± 0.379
3.378IlePro: 3.378 ± 1.354
2.027IleGln: 2.027 ± 0.746
2.027IleArg: 2.027 ± 0.746
2.703IleSer: 2.703 ± 1.765
0.676IleThr: 0.676 ± 0.95
2.027IleVal: 2.027 ± 1.684
1.351IleTrp: 1.351 ± 0.712
0.676IleTyr: 0.676 ± 0.906
0.0IleXaa: 0.0 ± 0.0
Lys
2.027LysAla: 2.027 ± 1.136
0.0LysCys: 0.0 ± 0.0
0.676LysAsp: 0.676 ± 0.379
1.351LysGlu: 1.351 ± 1.9
2.027LysPhe: 2.027 ± 0.694
3.378LysGly: 3.378 ± 1.893
2.027LysHis: 2.027 ± 0.694
1.351LysIle: 1.351 ± 0.712
0.0LysLys: 0.0 ± 0.0
2.027LysLeu: 2.027 ± 1.149
0.0LysMet: 0.0 ± 0.955
1.351LysAsn: 1.351 ± 0.757
2.703LysPro: 2.703 ± 0.862
3.378LysGln: 3.378 ± 1.345
1.351LysArg: 1.351 ± 0.757
0.676LysSer: 0.676 ± 0.379
2.703LysThr: 2.703 ± 1.515
0.0LysVal: 0.0 ± 0.0
0.676LysTrp: 0.676 ± 0.379
0.676LysTyr: 0.676 ± 0.379
0.0LysXaa: 0.0 ± 0.0
Leu
8.108LeuAla: 8.108 ± 1.9
4.73LeuCys: 4.73 ± 1.175
5.405LeuAsp: 5.405 ± 1.97
1.351LeuGlu: 1.351 ± 0.757
5.405LeuPhe: 5.405 ± 1.034
12.162LeuGly: 12.162 ± 3.297
4.054LeuHis: 4.054 ± 1.643
5.405LeuIle: 5.405 ± 3.161
2.027LeuLys: 2.027 ± 0.939
22.297LeuLeu: 22.297 ± 6.009
2.027LeuMet: 2.027 ± 1.149
6.757LeuAsn: 6.757 ± 3.509
9.459LeuPro: 9.459 ± 1.247
5.405LeuGln: 5.405 ± 1.334
8.784LeuArg: 8.784 ± 2.327
8.784LeuSer: 8.784 ± 1.532
6.757LeuThr: 6.757 ± 1.328
8.108LeuVal: 8.108 ± 3.127
3.378LeuTrp: 3.378 ± 2.616
3.378LeuTyr: 3.378 ± 1.165
0.0LeuXaa: 0.0 ± 0.0
Met
0.676MetAla: 0.676 ± 1.05
0.676MetCys: 0.676 ± 0.906
1.351MetAsp: 1.351 ± 0.766
0.676MetGlu: 0.676 ± 1.05
0.0MetPhe: 0.0 ± 0.0
2.703MetGly: 2.703 ± 0.862
1.351MetHis: 1.351 ± 0.757
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.676MetLeu: 0.676 ± 0.379
0.676MetMet: 0.676 ± 0.906
0.676MetAsn: 0.676 ± 1.05
2.027MetPro: 2.027 ± 0.694
0.0MetGln: 0.0 ± 0.0
1.351MetArg: 1.351 ± 1.343
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.351MetTrp: 1.351 ± 1.343
0.676MetTyr: 0.676 ± 0.95
0.0MetXaa: 0.0 ± 0.0
Asn
0.676AsnAla: 0.676 ± 0.95
2.027AsnCys: 2.027 ± 2.143
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
0.676AsnPhe: 0.676 ± 1.05
1.351AsnGly: 1.351 ± 0.757
2.703AsnHis: 2.703 ± 1.515
0.676AsnIle: 0.676 ± 0.906
0.676AsnLys: 0.676 ± 0.379
5.405AsnLeu: 5.405 ± 1.212
0.0AsnMet: 0.0 ± 0.0
1.351AsnAsn: 1.351 ± 0.712
3.378AsnPro: 3.378 ± 1.136
1.351AsnGln: 1.351 ± 1.45
1.351AsnArg: 1.351 ± 0.757
4.054AsnSer: 4.054 ± 0.658
2.703AsnThr: 2.703 ± 0.985
0.0AsnVal: 0.0 ± 0.0
0.676AsnTrp: 0.676 ± 0.379
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.757ProAla: 6.757 ± 2.086
2.027ProCys: 2.027 ± 0.939
3.378ProAsp: 3.378 ± 1.893
4.054ProGlu: 4.054 ± 1.325
5.405ProPhe: 5.405 ± 1.71
3.378ProGly: 3.378 ± 1.354
2.703ProHis: 2.703 ± 0.901
2.027ProIle: 2.027 ± 1.015
2.027ProLys: 2.027 ± 1.136
10.811ProLeu: 10.811 ± 1.478
1.351ProMet: 1.351 ± 0.688
4.73ProAsn: 4.73 ± 1.175
8.108ProPro: 8.108 ± 2.151
3.378ProGln: 3.378 ± 1.165
8.108ProArg: 8.108 ± 3.855
6.081ProSer: 6.081 ± 1.128
5.405ProThr: 5.405 ± 2.43
2.703ProVal: 2.703 ± 0.985
2.027ProTrp: 2.027 ± 0.694
3.378ProTyr: 3.378 ± 0.588
0.0ProXaa: 0.0 ± 0.0
Gln
4.054GlnAla: 4.054 ± 2.297
0.676GlnCys: 0.676 ± 0.379
0.676GlnAsp: 0.676 ± 0.95
0.0GlnGlu: 0.0 ± 0.0
1.351GlnPhe: 1.351 ± 0.757
1.351GlnGly: 1.351 ± 0.922
1.351GlnHis: 1.351 ± 0.757
0.0GlnIle: 0.0 ± 0.0
0.676GlnLys: 0.676 ± 0.379
4.054GlnLeu: 4.054 ± 1.491
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.703GlnPro: 2.703 ± 0.901
0.676GlnGln: 0.676 ± 0.95
4.054GlnArg: 4.054 ± 1.878
7.432GlnSer: 7.432 ± 0.721
3.378GlnThr: 3.378 ± 1.354
2.027GlnVal: 2.027 ± 1.015
1.351GlnTrp: 1.351 ± 1.45
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.378ArgAla: 3.378 ± 0.868
0.676ArgCys: 0.676 ± 0.906
2.027ArgAsp: 2.027 ± 1.94
2.027ArgGlu: 2.027 ± 1.136
4.054ArgPhe: 4.054 ± 1.451
5.405ArgGly: 5.405 ± 1.444
2.027ArgHis: 2.027 ± 1.94
2.027ArgIle: 2.027 ± 1.136
1.351ArgLys: 1.351 ± 0.757
8.108ArgLeu: 8.108 ± 2.622
0.676ArgMet: 0.676 ± 0.95
0.676ArgAsn: 0.676 ± 0.95
8.108ArgPro: 8.108 ± 2.622
3.378ArgGln: 3.378 ± 0.868
13.514ArgArg: 13.514 ± 7.682
6.757ArgSer: 6.757 ± 2.839
8.108ArgThr: 8.108 ± 2.128
3.378ArgVal: 3.378 ± 1.893
2.027ArgTrp: 2.027 ± 0.694
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.73SerAla: 4.73 ± 1.192
2.703SerCys: 2.703 ± 1.765
2.027SerAsp: 2.027 ± 0.939
2.703SerGlu: 2.703 ± 0.862
6.081SerPhe: 6.081 ± 1.28
3.378SerGly: 3.378 ± 1.136
3.378SerHis: 3.378 ± 1.893
2.027SerIle: 2.027 ± 1.136
2.703SerLys: 2.703 ± 0.738
8.784SerLeu: 8.784 ± 2.11
0.0SerMet: 0.0 ± 0.0
1.351SerAsn: 1.351 ± 0.766
14.865SerPro: 14.865 ± 2.811
3.378SerGln: 3.378 ± 2.738
8.108SerArg: 8.108 ± 3.632
8.108SerSer: 8.108 ± 0.392
4.73SerThr: 4.73 ± 1.817
4.054SerVal: 4.054 ± 1.491
2.027SerTrp: 2.027 ± 2.717
3.378SerTyr: 3.378 ± 1.165
0.0SerXaa: 0.0 ± 0.0
Thr
4.73ThrAla: 4.73 ± 1.175
2.703ThrCys: 2.703 ± 3.004
0.0ThrAsp: 0.0 ± 0.0
0.676ThrGlu: 0.676 ± 0.379
2.703ThrPhe: 2.703 ± 0.901
2.703ThrGly: 2.703 ± 1.515
4.054ThrHis: 4.054 ± 2.272
2.703ThrIle: 2.703 ± 1.765
3.378ThrLys: 3.378 ± 1.136
3.378ThrLeu: 3.378 ± 1.031
0.676ThrMet: 0.676 ± 0.906
2.027ThrAsn: 2.027 ± 1.136
5.405ThrPro: 5.405 ± 1.33
1.351ThrGln: 1.351 ± 0.757
3.378ThrArg: 3.378 ± 1.165
9.459ThrSer: 9.459 ± 3.036
4.054ThrThr: 4.054 ± 1.325
2.027ThrVal: 2.027 ± 2.85
2.703ThrTrp: 2.703 ± 1.897
2.703ThrTyr: 2.703 ± 1.096
0.0ThrXaa: 0.0 ± 0.0
Val
2.027ValAla: 2.027 ± 1.136
2.703ValCys: 2.703 ± 0.862
2.027ValAsp: 2.027 ± 1.136
2.703ValGlu: 2.703 ± 1.897
2.703ValPhe: 2.703 ± 0.862
3.378ValGly: 3.378 ± 1.554
1.351ValHis: 1.351 ± 0.757
0.676ValIle: 0.676 ± 0.95
0.0ValLys: 0.0 ± 0.0
7.432ValLeu: 7.432 ± 3.117
0.0ValMet: 0.0 ± 0.0
0.676ValAsn: 0.676 ± 0.379
2.703ValPro: 2.703 ± 0.985
2.027ValGln: 2.027 ± 2.85
4.054ValArg: 4.054 ± 2.517
4.054ValSer: 4.054 ± 1.479
1.351ValThr: 1.351 ± 0.922
4.054ValVal: 4.054 ± 0.992
0.0ValTrp: 0.0 ± 0.0
0.676ValTyr: 0.676 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.676TrpAla: 0.676 ± 0.906
0.0TrpCys: 0.0 ± 0.0
1.351TrpAsp: 1.351 ± 1.427
2.703TrpGlu: 2.703 ± 0.985
1.351TrpPhe: 1.351 ± 1.343
4.054TrpGly: 4.054 ± 1.232
0.0TrpHis: 0.0 ± 0.0
0.676TrpIle: 0.676 ± 0.95
1.351TrpLys: 1.351 ± 0.757
4.73TrpLeu: 4.73 ± 2.081
2.027TrpMet: 2.027 ± 1.585
0.0TrpAsn: 0.0 ± 0.0
1.351TrpPro: 1.351 ± 0.712
0.676TrpGln: 0.676 ± 1.05
1.351TrpArg: 1.351 ± 0.712
1.351TrpSer: 1.351 ± 0.757
1.351TrpThr: 1.351 ± 1.811
1.351TrpVal: 1.351 ± 0.766
1.351TrpTrp: 1.351 ± 0.712
1.351TrpTyr: 1.351 ± 1.45
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.351TyrAla: 1.351 ± 0.757
0.676TyrCys: 0.676 ± 0.906
0.0TyrAsp: 0.0 ± 0.0
0.676TyrGlu: 0.676 ± 0.95
0.676TyrPhe: 0.676 ± 0.379
0.0TyrGly: 0.0 ± 0.0
0.676TyrHis: 0.676 ± 0.379
1.351TyrIle: 1.351 ± 1.45
2.703TyrLys: 2.703 ± 0.901
4.054TyrLeu: 4.054 ± 0.658
0.676TyrMet: 0.676 ± 0.379
0.0TyrAsn: 0.0 ± 0.0
2.703TyrPro: 2.703 ± 0.901
0.676TyrGln: 0.676 ± 0.379
1.351TyrArg: 1.351 ± 0.766
2.027TyrSer: 2.027 ± 1.136
0.676TyrThr: 0.676 ± 0.379
2.027TyrVal: 2.027 ± 0.939
1.351TyrTrp: 1.351 ± 1.45
0.676TyrTyr: 0.676 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski