Amino acid dipepetide frequency for Tent-making bat hepatitis B virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.017AlaAla: 8.017 ± 2.249
2.187AlaCys: 2.187 ± 0.974
5.102AlaAsp: 5.102 ± 1.278
0.729AlaGlu: 0.729 ± 0.393
2.187AlaPhe: 2.187 ± 1.179
7.289AlaGly: 7.289 ± 3.454
0.0AlaHis: 0.0 ± 0.0
2.915AlaIle: 2.915 ± 0.94
0.0AlaLys: 0.0 ± 0.0
8.017AlaLeu: 8.017 ± 3.583
0.729AlaMet: 0.729 ± 0.742
0.729AlaAsn: 0.729 ± 0.393
3.644AlaPro: 3.644 ± 0.622
1.458AlaGln: 1.458 ± 0.946
4.373AlaArg: 4.373 ± 1.948
8.017AlaSer: 8.017 ± 0.885
5.831AlaThr: 5.831 ± 1.663
3.644AlaVal: 3.644 ± 1.33
0.729AlaTrp: 0.729 ± 0.928
3.644AlaTyr: 3.644 ± 1.227
0.0AlaXaa: 0.0 ± 0.0
Cys
0.729CysAla: 0.729 ± 1.073
1.458CysCys: 1.458 ± 1.855
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.729CysPhe: 0.729 ± 0.393
2.187CysGly: 2.187 ± 1.045
2.187CysHis: 2.187 ± 1.984
0.729CysIle: 0.729 ± 0.928
0.0CysLys: 0.0 ± 0.0
3.644CysLeu: 3.644 ± 0.622
0.729CysMet: 0.729 ± 0.71
0.0CysAsn: 0.0 ± 0.0
2.187CysPro: 2.187 ± 1.605
0.729CysGln: 0.729 ± 0.393
2.187CysArg: 2.187 ± 1.136
2.915CysSer: 2.915 ± 0.765
2.915CysThr: 2.915 ± 2.523
3.644CysVal: 3.644 ± 1.408
1.458CysTrp: 1.458 ± 0.786
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.644AspAla: 3.644 ± 1.675
0.729AspCys: 0.729 ± 0.393
2.187AspAsp: 2.187 ± 0.757
0.729AspGlu: 0.729 ± 0.936
2.187AspPhe: 2.187 ± 1.654
0.0AspGly: 0.0 ± 0.0
2.187AspHis: 2.187 ± 0.974
3.644AspIle: 3.644 ± 1.932
1.458AspLys: 1.458 ± 0.786
5.831AspLeu: 5.831 ± 2.418
0.0AspMet: 0.0 ± 0.0
0.729AspAsn: 0.729 ± 0.393
0.729AspPro: 0.729 ± 1.073
0.729AspGln: 0.729 ± 0.393
2.187AspArg: 2.187 ± 1.179
1.458AspSer: 1.458 ± 0.946
0.0AspThr: 0.0 ± 0.0
2.187AspVal: 2.187 ± 0.757
2.915AspTrp: 2.915 ± 1.826
1.458AspTyr: 1.458 ± 0.786
0.0AspXaa: 0.0 ± 0.0
Glu
2.915GluAla: 2.915 ± 0.848
0.0GluCys: 0.0 ± 0.0
2.915GluAsp: 2.915 ± 0.94
0.729GluGlu: 0.729 ± 0.393
0.729GluPhe: 0.729 ± 0.936
2.187GluGly: 2.187 ± 1.179
0.729GluHis: 0.729 ± 0.393
0.729GluIle: 0.729 ± 0.928
1.458GluLys: 1.458 ± 0.786
1.458GluLeu: 1.458 ± 1.871
0.0GluMet: 0.0 ± 0.0
0.729GluAsn: 0.729 ± 0.936
0.0GluPro: 0.0 ± 0.0
1.458GluGln: 1.458 ± 0.755
2.187GluArg: 2.187 ± 1.045
1.458GluSer: 1.458 ± 0.755
2.187GluThr: 2.187 ± 0.757
0.729GluVal: 0.729 ± 1.073
0.0GluTrp: 0.0 ± 0.0
2.915GluTyr: 2.915 ± 1.001
0.0GluXaa: 0.0 ± 0.0
Phe
5.102PheAla: 5.102 ± 1.483
0.729PheCys: 0.729 ± 0.393
0.729PheAsp: 0.729 ± 0.393
0.0PheGlu: 0.0 ± 0.0
3.644PhePhe: 3.644 ± 1.227
4.373PheGly: 4.373 ± 2.264
0.0PheHis: 0.0 ± 0.0
3.644PheIle: 3.644 ± 1.523
2.187PheLys: 2.187 ± 1.179
5.102PheLeu: 5.102 ± 2.134
1.458PheMet: 1.458 ± 0.71
0.0PheAsn: 0.0 ± 0.0
3.644PhePro: 3.644 ± 0.622
1.458PheGln: 1.458 ± 0.786
2.187PheArg: 2.187 ± 1.179
3.644PheSer: 3.644 ± 1.135
2.187PheThr: 2.187 ± 0.974
1.458PheVal: 1.458 ± 0.786
1.458PheTrp: 1.458 ± 0.71
0.729PheTyr: 0.729 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
2.915GlyAla: 2.915 ± 1.572
2.187GlyCys: 2.187 ± 1.136
0.729GlyAsp: 0.729 ± 0.393
2.187GlyGlu: 2.187 ± 0.676
2.187GlyPhe: 2.187 ± 1.179
3.644GlyGly: 3.644 ± 2.692
2.187GlyHis: 2.187 ± 0.974
3.644GlyIle: 3.644 ± 1.964
2.915GlyLys: 2.915 ± 1.826
11.662GlyLeu: 11.662 ± 2.928
0.729GlyMet: 0.729 ± 0.393
0.729GlyAsn: 0.729 ± 0.928
8.017GlyPro: 8.017 ± 2.004
3.644GlyGln: 3.644 ± 0.828
2.187GlyArg: 2.187 ± 1.179
5.102GlySer: 5.102 ± 2.061
3.644GlyThr: 3.644 ± 0.951
2.915GlyVal: 2.915 ± 1.509
2.187GlyTrp: 2.187 ± 1.179
2.915GlyTyr: 2.915 ± 1.826
0.0GlyXaa: 0.0 ± 0.0
His
0.729HisAla: 0.729 ± 0.393
2.187HisCys: 2.187 ± 0.757
0.0HisAsp: 0.0 ± 0.0
1.458HisGlu: 1.458 ± 1.871
3.644HisPhe: 3.644 ± 1.135
1.458HisGly: 1.458 ± 0.786
3.644HisHis: 3.644 ± 0.622
1.458HisIle: 1.458 ± 0.755
1.458HisLys: 1.458 ± 0.946
7.289HisLeu: 7.289 ± 1.762
0.0HisMet: 0.0 ± 0.0
0.729HisAsn: 0.729 ± 0.393
2.187HisPro: 2.187 ± 0.974
1.458HisGln: 1.458 ± 0.786
2.915HisArg: 2.915 ± 1.145
2.187HisSer: 2.187 ± 1.136
0.729HisThr: 0.729 ± 0.936
2.187HisVal: 2.187 ± 1.179
0.0HisTrp: 0.0 ± 0.0
1.458HisTyr: 1.458 ± 0.946
0.0HisXaa: 0.0 ± 0.0
Ile
1.458IleAla: 1.458 ± 0.786
2.187IleCys: 2.187 ± 0.974
1.458IleAsp: 1.458 ± 0.755
0.729IleGlu: 0.729 ± 1.073
2.187IlePhe: 2.187 ± 0.676
0.729IleGly: 0.729 ± 0.928
1.458IleHis: 1.458 ± 0.786
2.187IleIle: 2.187 ± 0.676
2.187IleLys: 2.187 ± 1.179
4.373IleLeu: 4.373 ± 0.672
2.915IleMet: 2.915 ± 0.765
3.644IleAsn: 3.644 ± 0.828
3.644IlePro: 3.644 ± 1.33
0.729IleGln: 0.729 ± 0.928
1.458IleArg: 1.458 ± 1.871
2.915IleSer: 2.915 ± 1.846
2.187IleThr: 2.187 ± 0.757
0.0IleVal: 0.0 ± 0.0
2.187IleTrp: 2.187 ± 1.045
2.915IleTyr: 2.915 ± 0.765
0.0IleXaa: 0.0 ± 0.0
Lys
1.458LysAla: 1.458 ± 0.755
0.729LysCys: 0.729 ± 0.393
0.729LysAsp: 0.729 ± 0.393
0.729LysGlu: 0.729 ± 0.936
1.458LysPhe: 1.458 ± 0.786
2.187LysGly: 2.187 ± 1.179
2.187LysHis: 2.187 ± 0.757
2.187LysIle: 2.187 ± 0.676
2.187LysLys: 2.187 ± 1.179
5.831LysLeu: 5.831 ± 2.418
0.0LysMet: 0.0 ± 0.0
0.729LysAsn: 0.729 ± 0.393
1.458LysPro: 1.458 ± 0.786
2.187LysGln: 2.187 ± 0.757
2.187LysArg: 2.187 ± 1.179
1.458LysSer: 1.458 ± 0.786
2.187LysThr: 2.187 ± 0.676
1.458LysVal: 1.458 ± 0.786
0.0LysTrp: 0.0 ± 0.0
2.915LysTyr: 2.915 ± 0.765
0.0LysXaa: 0.0 ± 0.0
Leu
8.746LeuAla: 8.746 ± 1.053
1.458LeuCys: 1.458 ± 1.434
3.644LeuAsp: 3.644 ± 0.622
2.915LeuGlu: 2.915 ± 1.509
2.187LeuPhe: 2.187 ± 1.605
7.289LeuGly: 7.289 ± 2.271
4.373LeuHis: 4.373 ± 1.721
5.102LeuIle: 5.102 ± 1.1
2.187LeuLys: 2.187 ± 0.757
20.408LeuLeu: 20.408 ± 6.012
1.458LeuMet: 1.458 ± 0.946
2.915LeuAsn: 2.915 ± 0.913
13.12LeuPro: 13.12 ± 1.385
7.289LeuGln: 7.289 ± 0.467
9.475LeuArg: 9.475 ± 5.142
12.391LeuSer: 12.391 ± 2.18
5.831LeuThr: 5.831 ± 2.288
8.746LeuVal: 8.746 ± 0.428
4.373LeuTrp: 4.373 ± 3.21
5.831LeuTyr: 5.831 ± 2.284
0.0LeuXaa: 0.0 ± 0.0
Met
2.187MetAla: 2.187 ± 1.136
1.458MetCys: 1.458 ± 1.381
1.458MetAsp: 1.458 ± 0.946
0.729MetGlu: 0.729 ± 0.936
0.729MetPhe: 0.729 ± 0.393
2.187MetGly: 2.187 ± 1.179
1.458MetHis: 1.458 ± 0.755
1.458MetIle: 1.458 ± 0.946
0.0MetLys: 0.0 ± 0.0
2.187MetLeu: 2.187 ± 0.974
0.729MetMet: 0.729 ± 0.928
0.729MetAsn: 0.729 ± 0.928
1.458MetPro: 1.458 ± 0.71
0.729MetGln: 0.729 ± 0.393
0.0MetArg: 0.0 ± 0.0
0.729MetSer: 0.729 ± 0.393
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.729MetTrp: 0.729 ± 0.928
0.729MetTyr: 0.729 ± 0.393
0.0MetXaa: 0.0 ± 0.0
Asn
2.187AsnAla: 2.187 ± 1.654
2.187AsnCys: 2.187 ± 2.159
0.729AsnAsp: 0.729 ± 0.393
1.458AsnGlu: 1.458 ± 0.786
0.729AsnPhe: 0.729 ± 0.393
0.0AsnGly: 0.0 ± 0.0
2.187AsnHis: 2.187 ± 1.045
0.729AsnIle: 0.729 ± 1.073
0.0AsnLys: 0.0 ± 0.0
5.102AsnLeu: 5.102 ± 0.957
0.729AsnMet: 0.729 ± 1.073
0.729AsnAsn: 0.729 ± 1.073
2.187AsnPro: 2.187 ± 1.179
2.187AsnGln: 2.187 ± 1.179
2.187AsnArg: 2.187 ± 0.974
1.458AsnSer: 1.458 ± 0.71
1.458AsnThr: 1.458 ± 0.71
0.0AsnVal: 0.0 ± 0.0
0.729AsnTrp: 0.729 ± 0.393
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
8.017ProAla: 8.017 ± 3.791
2.187ProCys: 2.187 ± 0.974
5.102ProAsp: 5.102 ± 2.16
2.915ProGlu: 2.915 ± 0.94
3.644ProPhe: 3.644 ± 1.33
4.373ProGly: 4.373 ± 1.353
2.187ProHis: 2.187 ± 0.757
5.102ProIle: 5.102 ± 2.134
1.458ProLys: 1.458 ± 0.786
9.475ProLeu: 9.475 ± 2.042
2.187ProMet: 2.187 ± 1.096
2.187ProAsn: 2.187 ± 0.757
2.915ProPro: 2.915 ± 1.145
2.187ProGln: 2.187 ± 1.179
7.289ProArg: 7.289 ± 2.404
8.017ProSer: 8.017 ± 1.773
3.644ProThr: 3.644 ± 2.066
5.831ProVal: 5.831 ± 3.034
3.644ProTrp: 3.644 ± 0.828
0.729ProTyr: 0.729 ± 0.393
0.0ProXaa: 0.0 ± 0.0
Gln
2.915GlnAla: 2.915 ± 1.145
1.458GlnCys: 1.458 ± 0.786
1.458GlnAsp: 1.458 ± 0.786
0.729GlnGlu: 0.729 ± 0.928
2.187GlnPhe: 2.187 ± 0.676
1.458GlnGly: 1.458 ± 0.755
1.458GlnHis: 1.458 ± 0.786
0.0GlnIle: 0.0 ± 0.0
2.187GlnLys: 2.187 ± 1.179
2.187GlnLeu: 2.187 ± 0.757
0.0GlnMet: 0.0 ± 0.909
1.458GlnAsn: 1.458 ± 0.71
1.458GlnPro: 1.458 ± 0.786
2.187GlnGln: 2.187 ± 1.045
2.187GlnArg: 2.187 ± 1.179
4.373GlnSer: 4.373 ± 1.056
2.915GlnThr: 2.915 ± 0.94
2.187GlnVal: 2.187 ± 0.676
1.458GlnTrp: 1.458 ± 0.786
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.915ArgAla: 2.915 ± 1.892
0.0ArgCys: 0.0 ± 0.0
1.458ArgAsp: 1.458 ± 1.458
2.187ArgGlu: 2.187 ± 0.974
3.644ArgPhe: 3.644 ± 0.828
7.289ArgGly: 7.289 ± 2.815
2.187ArgHis: 2.187 ± 1.187
1.458ArgIle: 1.458 ± 0.946
5.102ArgLys: 5.102 ± 2.75
5.831ArgLeu: 5.831 ± 1.569
0.729ArgMet: 0.729 ± 1.073
0.729ArgAsn: 0.729 ± 0.393
6.56ArgPro: 6.56 ± 2.793
0.729ArgGln: 0.729 ± 0.393
11.662ArgArg: 11.662 ± 7.77
5.102ArgSer: 5.102 ± 3.142
5.831ArgThr: 5.831 ± 1.057
2.915ArgVal: 2.915 ± 1.572
1.458ArgTrp: 1.458 ± 0.71
0.729ArgTyr: 0.729 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
5.831SerAla: 5.831 ± 1.447
2.915SerCys: 2.915 ± 0.765
2.187SerAsp: 2.187 ± 1.187
2.915SerGlu: 2.915 ± 1.572
6.56SerPhe: 6.56 ± 2.15
5.102SerGly: 5.102 ± 1.147
2.915SerHis: 2.915 ± 0.848
2.187SerIle: 2.187 ± 1.179
1.458SerLys: 1.458 ± 0.755
7.289SerLeu: 7.289 ± 1.657
0.729SerMet: 0.729 ± 0.393
4.373SerAsn: 4.373 ± 0.705
15.306SerPro: 15.306 ± 4.181
2.187SerGln: 2.187 ± 0.757
4.373SerArg: 4.373 ± 1.056
7.289SerSer: 7.289 ± 1.716
5.102SerThr: 5.102 ± 0.957
2.915SerVal: 2.915 ± 1.509
2.915SerTrp: 2.915 ± 1.846
0.729SerTyr: 0.729 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
2.187ThrAla: 2.187 ± 1.045
2.187ThrCys: 2.187 ± 2.783
0.729ThrAsp: 0.729 ± 0.393
0.729ThrGlu: 0.729 ± 0.928
3.644ThrPhe: 3.644 ± 0.622
5.831ThrGly: 5.831 ± 1.057
1.458ThrHis: 1.458 ± 0.786
2.187ThrIle: 2.187 ± 0.676
3.644ThrLys: 3.644 ± 0.622
5.102ThrLeu: 5.102 ± 1.414
2.915ThrMet: 2.915 ± 1.572
1.458ThrAsn: 1.458 ± 0.786
6.56ThrPro: 6.56 ± 1.807
1.458ThrGln: 1.458 ± 0.786
2.915ThrArg: 2.915 ± 1.892
6.56ThrSer: 6.56 ± 1.707
2.915ThrThr: 2.915 ± 0.765
2.187ThrVal: 2.187 ± 1.045
1.458ThrTrp: 1.458 ± 1.458
1.458ThrTyr: 1.458 ± 0.786
0.0ThrXaa: 0.0 ± 0.0
Val
3.644ValAla: 3.644 ± 0.828
2.187ValCys: 2.187 ± 0.676
2.915ValAsp: 2.915 ± 0.848
0.729ValGlu: 0.729 ± 0.936
0.0ValPhe: 0.0 ± 0.0
3.644ValGly: 3.644 ± 0.828
1.458ValHis: 1.458 ± 0.755
0.729ValIle: 0.729 ± 0.936
0.729ValLys: 0.729 ± 0.393
7.289ValLeu: 7.289 ± 3.453
0.0ValMet: 0.0 ± 0.0
3.644ValAsn: 3.644 ± 1.675
4.373ValPro: 4.373 ± 2.357
0.729ValGln: 0.729 ± 0.393
3.644ValArg: 3.644 ± 1.964
6.56ValSer: 6.56 ± 2.057
3.644ValThr: 3.644 ± 1.135
2.915ValVal: 2.915 ± 1.572
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.187TrpAla: 2.187 ± 1.605
0.0TrpCys: 0.0 ± 0.0
0.729TrpAsp: 0.729 ± 0.393
1.458TrpGlu: 1.458 ± 0.786
0.729TrpPhe: 0.729 ± 0.928
4.373TrpGly: 4.373 ± 2.131
0.0TrpHis: 0.0 ± 0.0
1.458TrpIle: 1.458 ± 1.381
0.729TrpLys: 0.729 ± 0.393
4.373TrpLeu: 4.373 ± 1.335
2.187TrpMet: 2.187 ± 2.159
0.0TrpAsn: 0.0 ± 0.0
1.458TrpPro: 1.458 ± 0.786
0.729TrpGln: 0.729 ± 0.393
1.458TrpArg: 1.458 ± 0.71
0.729TrpSer: 0.729 ± 0.393
2.915TrpThr: 2.915 ± 2.093
1.458TrpVal: 1.458 ± 0.786
3.644TrpTrp: 3.644 ± 2.302
2.187TrpTyr: 2.187 ± 2.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.458TyrAla: 1.458 ± 0.786
0.0TyrCys: 0.0 ± 0.0
0.729TyrAsp: 0.729 ± 1.073
1.458TyrGlu: 1.458 ± 0.755
0.729TyrPhe: 0.729 ± 0.928
0.729TyrGly: 0.729 ± 0.393
2.915TyrHis: 2.915 ± 0.94
0.0TyrIle: 0.0 ± 0.0
2.915TyrLys: 2.915 ± 0.94
7.289TyrLeu: 7.289 ± 2.828
0.729TyrMet: 0.729 ± 0.393
0.729TyrAsn: 0.729 ± 0.393
2.187TyrPro: 2.187 ± 1.179
0.729TyrGln: 0.729 ± 0.393
1.458TyrArg: 1.458 ± 1.381
2.915TyrSer: 2.915 ± 1.572
1.458TyrThr: 1.458 ± 0.786
1.458TyrVal: 1.458 ± 0.946
1.458TyrTrp: 1.458 ± 0.71
0.729TyrTyr: 0.729 ± 0.393
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski