Amino acid dipepetide frequency for Bluegill hepatitis B virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.977AlaAla: 5.977 ± 2.515
1.195AlaCys: 1.195 ± 0.628
2.391AlaAsp: 2.391 ± 0.839
0.598AlaGlu: 0.598 ± 0.581
4.782AlaPhe: 4.782 ± 0.695
1.793AlaGly: 1.793 ± 1.308
4.184AlaHis: 4.184 ± 0.904
2.989AlaIle: 2.989 ± 1.132
2.391AlaLys: 2.391 ± 0.773
4.782AlaLeu: 4.782 ± 0.644
1.195AlaMet: 1.195 ± 0.642
3.586AlaAsn: 3.586 ± 1.124
5.38AlaPro: 5.38 ± 0.86
4.782AlaGln: 4.782 ± 1.995
5.977AlaArg: 5.977 ± 2.656
3.586AlaSer: 3.586 ± 0.565
7.173AlaThr: 7.173 ± 1.176
3.586AlaVal: 3.586 ± 0.785
0.0AlaTrp: 0.0 ± 0.0
4.184AlaTyr: 4.184 ± 1.462
0.0AlaXaa: 0.0 ± 0.0
Cys
0.598CysAla: 0.598 ± 0.581
3.586CysCys: 3.586 ± 2.732
0.598CysAsp: 0.598 ± 0.581
0.0CysGlu: 0.0 ± 0.0
0.598CysPhe: 0.598 ± 0.338
0.598CysGly: 0.598 ± 0.338
1.195CysHis: 1.195 ± 0.616
2.391CysIle: 2.391 ± 1.003
0.598CysLys: 0.598 ± 0.581
1.793CysLeu: 1.793 ± 1.345
0.598CysMet: 0.598 ± 0.338
0.598CysAsn: 0.598 ± 0.581
0.598CysPro: 0.598 ± 0.581
2.989CysGln: 2.989 ± 2.068
0.0CysArg: 0.0 ± 0.0
2.989CysSer: 2.989 ± 1.004
1.195CysThr: 1.195 ± 1.163
1.195CysVal: 1.195 ± 0.89
0.0CysTrp: 0.0 ± 0.0
1.195CysTyr: 1.195 ± 1.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.184AspAla: 4.184 ± 0.546
1.195AspCys: 1.195 ± 1.163
2.391AspAsp: 2.391 ± 1.351
0.0AspGlu: 0.0 ± 0.0
4.782AspPhe: 4.782 ± 2.782
0.598AspGly: 0.598 ± 0.338
0.598AspHis: 0.598 ± 0.338
2.391AspIle: 2.391 ± 0.742
2.989AspLys: 2.989 ± 1.008
1.793AspLeu: 1.793 ± 0.731
0.0AspMet: 0.0 ± 0.0
1.793AspAsn: 1.793 ± 1.013
2.989AspPro: 2.989 ± 0.566
3.586AspGln: 3.586 ± 1.719
2.989AspArg: 2.989 ± 0.566
2.391AspSer: 2.391 ± 0.742
0.598AspThr: 0.598 ± 0.338
1.195AspVal: 1.195 ± 0.675
0.598AspTrp: 0.598 ± 0.581
0.598AspTyr: 0.598 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
2.391GluAla: 2.391 ± 0.999
0.598GluCys: 0.598 ± 0.581
0.598GluAsp: 0.598 ± 0.338
0.598GluGlu: 0.598 ± 0.338
1.195GluPhe: 1.195 ± 1.021
1.195GluGly: 1.195 ± 1.021
1.195GluHis: 1.195 ± 0.675
2.391GluIle: 2.391 ± 1.234
1.793GluLys: 1.793 ± 0.595
2.989GluLeu: 2.989 ± 1.004
0.598GluMet: 0.598 ± 0.585
0.598GluAsn: 0.598 ± 0.338
1.195GluPro: 1.195 ± 0.628
1.195GluGln: 1.195 ± 0.675
0.598GluArg: 0.598 ± 0.338
1.793GluSer: 1.793 ± 1.004
0.0GluThr: 0.0 ± 0.0
2.391GluVal: 2.391 ± 1.351
0.598GluTrp: 0.598 ± 0.338
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.184PheAla: 4.184 ± 1.176
1.195PheCys: 1.195 ± 0.481
0.0PheAsp: 0.0 ± 0.0
0.0PheGlu: 0.0 ± 0.0
4.782PhePhe: 4.782 ± 0.639
4.782PheGly: 4.782 ± 2.361
2.391PheHis: 2.391 ± 1.134
1.793PheIle: 1.793 ± 1.004
1.195PheLys: 1.195 ± 0.481
7.77PheLeu: 7.77 ± 1.629
1.793PheMet: 1.793 ± 1.004
0.598PheAsn: 0.598 ± 0.338
3.586PhePro: 3.586 ± 0.565
0.598PheGln: 0.598 ± 0.338
1.793PheArg: 1.793 ± 0.746
5.38PheSer: 5.38 ± 1.388
1.793PheThr: 1.793 ± 0.595
1.793PheVal: 1.793 ± 1.013
4.184PheTrp: 4.184 ± 2.076
1.793PheTyr: 1.793 ± 0.862
0.0PheXaa: 0.0 ± 0.0
Gly
4.184GlyAla: 4.184 ± 0.976
1.195GlyCys: 1.195 ± 0.481
2.989GlyAsp: 2.989 ± 1.893
0.598GlyGlu: 0.598 ± 0.338
3.586GlyPhe: 3.586 ± 1.124
4.184GlyGly: 4.184 ± 0.546
1.793GlyHis: 1.793 ± 1.013
7.77GlyIle: 7.77 ± 1.432
3.586GlyLys: 3.586 ± 0.721
5.977GlyLeu: 5.977 ± 1.827
1.195GlyMet: 1.195 ± 0.971
1.793GlyAsn: 1.793 ± 0.898
2.989GlyPro: 2.989 ± 0.941
0.598GlyGln: 0.598 ± 0.338
4.184GlyArg: 4.184 ± 1.483
7.173GlySer: 7.173 ± 1.442
3.586GlyThr: 3.586 ± 1.088
2.391GlyVal: 2.391 ± 0.958
1.195GlyTrp: 1.195 ± 0.481
1.793GlyTyr: 1.793 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
3.586HisAla: 3.586 ± 1.358
0.0HisCys: 0.0 ± 0.0
0.598HisAsp: 0.598 ± 0.673
1.793HisGlu: 1.793 ± 0.595
1.793HisPhe: 1.793 ± 0.81
1.195HisGly: 1.195 ± 0.628
4.184HisHis: 4.184 ± 1.758
3.586HisIle: 3.586 ± 2.026
0.598HisLys: 0.598 ± 0.338
5.38HisLeu: 5.38 ± 1.465
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.391HisPro: 2.391 ± 0.839
1.195HisGln: 1.195 ± 0.481
1.793HisArg: 1.793 ± 1.013
8.966HisSer: 8.966 ± 1.459
0.598HisThr: 0.598 ± 0.338
4.782HisVal: 4.782 ± 1.566
1.195HisTrp: 1.195 ± 1.021
0.598HisTyr: 0.598 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
2.989IleAla: 2.989 ± 1.004
0.0IleCys: 0.0 ± 0.0
1.793IleAsp: 1.793 ± 1.004
0.598IleGlu: 0.598 ± 0.679
4.782IlePhe: 4.782 ± 2.007
2.989IleGly: 2.989 ± 1.028
3.586IleHis: 3.586 ± 1.445
4.782IleIle: 4.782 ± 1.317
1.195IleLys: 1.195 ± 0.675
9.564IleLeu: 9.564 ± 1.706
0.0IleMet: 0.0 ± 0.0
2.989IleAsn: 2.989 ± 1.893
2.989IlePro: 2.989 ± 1.132
2.989IleGln: 2.989 ± 0.566
1.195IleArg: 1.195 ± 1.021
3.586IleSer: 3.586 ± 1.62
4.184IleThr: 4.184 ± 1.541
4.184IleVal: 4.184 ± 1.758
1.793IleTrp: 1.793 ± 1.013
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.989LysAla: 2.989 ± 1.008
1.195LysCys: 1.195 ± 0.971
0.598LysAsp: 0.598 ± 0.581
1.195LysGlu: 1.195 ± 0.616
0.598LysPhe: 0.598 ± 0.338
2.989LysGly: 2.989 ± 0.898
1.793LysHis: 1.793 ± 0.706
3.586LysIle: 3.586 ± 0.565
1.793LysLys: 1.793 ± 0.706
7.173LysLeu: 7.173 ± 1.49
1.195LysMet: 1.195 ± 0.813
2.391LysAsn: 2.391 ± 0.773
1.793LysPro: 1.793 ± 0.81
0.0LysGln: 0.0 ± 0.0
1.195LysArg: 1.195 ± 0.481
2.989LysSer: 2.989 ± 0.898
4.184LysThr: 4.184 ± 0.546
1.793LysVal: 1.793 ± 0.595
1.195LysTrp: 1.195 ± 0.675
1.793LysTyr: 1.793 ± 0.731
0.0LysXaa: 0.0 ± 0.0
Leu
10.161LeuAla: 10.161 ± 3.294
4.184LeuCys: 4.184 ± 2.471
2.391LeuAsp: 2.391 ± 0.963
4.782LeuGlu: 4.782 ± 1.422
3.586LeuPhe: 3.586 ± 1.503
7.77LeuGly: 7.77 ± 1.944
2.989LeuHis: 2.989 ± 1.132
4.782LeuIle: 4.782 ± 1.201
1.195LeuLys: 1.195 ± 0.616
16.736LeuLeu: 16.736 ± 2.943
0.0LeuMet: 0.0 ± 0.0
2.989LeuAsn: 2.989 ± 0.898
5.977LeuPro: 5.977 ± 1.001
6.575LeuGln: 6.575 ± 0.742
8.966LeuArg: 8.966 ± 1.459
7.77LeuSer: 7.77 ± 1.104
7.173LeuThr: 7.173 ± 1.217
3.586LeuVal: 3.586 ± 1.189
2.989LeuTrp: 2.989 ± 1.663
4.782LeuTyr: 4.782 ± 2.187
0.0LeuXaa: 0.0 ± 0.0
Met
1.793MetAla: 1.793 ± 1.67
0.0MetCys: 0.0 ± 0.0
0.598MetAsp: 0.598 ± 0.338
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.793MetGly: 1.793 ± 0.706
0.598MetHis: 0.598 ± 0.338
0.0MetIle: 0.0 ± 0.0
0.598MetLys: 0.598 ± 0.679
2.989MetLeu: 2.989 ± 1.028
0.598MetMet: 0.598 ± 0.581
0.0MetAsn: 0.0 ± 0.0
2.391MetPro: 2.391 ± 0.893
1.793MetGln: 1.793 ± 0.898
0.598MetArg: 0.598 ± 0.673
0.598MetSer: 0.598 ± 0.338
2.989MetThr: 2.989 ± 0.566
1.195MetVal: 1.195 ± 0.628
0.598MetTrp: 0.598 ± 0.581
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.195AsnAla: 1.195 ± 1.021
2.989AsnCys: 2.989 ± 1.722
0.598AsnAsp: 0.598 ± 0.581
1.195AsnGlu: 1.195 ± 1.021
0.598AsnPhe: 0.598 ± 0.679
2.391AsnGly: 2.391 ± 0.963
0.0AsnHis: 0.0 ± 0.0
0.598AsnIle: 0.598 ± 0.581
2.391AsnLys: 2.391 ± 0.839
2.989AsnLeu: 2.989 ± 1.016
0.598AsnMet: 0.598 ± 0.338
1.195AsnAsn: 1.195 ± 0.675
3.586AsnPro: 3.586 ± 2.026
0.598AsnGln: 0.598 ± 0.338
5.38AsnArg: 5.38 ± 1.525
1.793AsnSer: 1.793 ± 0.862
1.195AsnThr: 1.195 ± 0.481
0.598AsnVal: 0.598 ± 0.338
0.0AsnTrp: 0.0 ± 0.0
1.793AsnTyr: 1.793 ± 0.898
0.0AsnXaa: 0.0 ± 0.0
Pro
4.184ProAla: 4.184 ± 1.462
0.598ProCys: 0.598 ± 0.338
1.195ProAsp: 1.195 ± 1.021
2.989ProGlu: 2.989 ± 0.566
2.989ProPhe: 2.989 ± 1.008
4.184ProGly: 4.184 ± 1.491
3.586ProHis: 3.586 ± 1.445
3.586ProIle: 3.586 ± 1.204
2.391ProLys: 2.391 ± 1.416
3.586ProLeu: 3.586 ± 1.124
1.195ProMet: 1.195 ± 0.675
4.782ProAsn: 4.782 ± 1.785
10.161ProPro: 10.161 ± 4.828
2.989ProGln: 2.989 ± 1.475
3.586ProArg: 3.586 ± 1.209
4.782ProSer: 4.782 ± 1.0
6.575ProThr: 6.575 ± 1.91
2.989ProVal: 2.989 ± 1.475
1.195ProTrp: 1.195 ± 0.481
1.793ProTyr: 1.793 ± 1.013
0.0ProXaa: 0.0 ± 0.0
Gln
1.793GlnAla: 1.793 ± 1.013
1.195GlnCys: 1.195 ± 0.971
2.989GlnAsp: 2.989 ± 1.008
0.598GlnGlu: 0.598 ± 0.338
1.793GlnPhe: 1.793 ± 0.746
1.793GlnGly: 1.793 ± 1.013
2.391GlnHis: 2.391 ± 0.773
2.989GlnIle: 2.989 ± 1.016
4.184GlnLys: 4.184 ± 2.907
4.782GlnLeu: 4.782 ± 1.683
1.793GlnMet: 1.793 ± 0.595
0.598GlnAsn: 0.598 ± 0.581
2.391GlnPro: 2.391 ± 0.963
2.391GlnGln: 2.391 ± 1.295
3.586GlnArg: 3.586 ± 2.026
1.793GlnSer: 1.793 ± 0.595
2.989GlnThr: 2.989 ± 1.635
2.989GlnVal: 2.989 ± 0.985
0.0GlnTrp: 0.0 ± 0.0
1.195GlnTyr: 1.195 ± 1.021
0.0GlnXaa: 0.0 ± 0.0
Arg
1.793ArgAla: 1.793 ± 0.746
1.195ArgCys: 1.195 ± 0.628
1.195ArgAsp: 1.195 ± 1.021
2.989ArgGlu: 2.989 ± 1.337
1.195ArgPhe: 1.195 ± 0.481
5.977ArgGly: 5.977 ± 2.885
4.184ArgHis: 4.184 ± 1.853
4.184ArgIle: 4.184 ± 0.654
2.391ArgLys: 2.391 ± 0.839
4.782ArgLeu: 4.782 ± 1.566
2.391ArgMet: 2.391 ± 0.878
1.195ArgAsn: 1.195 ± 0.616
1.793ArgPro: 1.793 ± 0.898
3.586ArgGln: 3.586 ± 1.719
7.173ArgArg: 7.173 ± 3.801
6.575ArgSer: 6.575 ± 2.735
2.391ArgThr: 2.391 ± 0.742
5.977ArgVal: 5.977 ± 2.84
1.195ArgTrp: 1.195 ± 0.616
1.195ArgTyr: 1.195 ± 1.359
0.0ArgXaa: 0.0 ± 0.0
Ser
8.368SerAla: 8.368 ± 1.983
0.598SerCys: 0.598 ± 0.673
4.782SerAsp: 4.782 ± 1.677
1.793SerGlu: 1.793 ± 0.898
5.977SerPhe: 5.977 ± 1.316
5.977SerGly: 5.977 ± 0.905
1.793SerHis: 1.793 ± 0.746
1.793SerIle: 1.793 ± 1.013
4.184SerLys: 4.184 ± 1.462
11.955SerLeu: 11.955 ± 1.804
1.195SerMet: 1.195 ± 0.616
1.793SerAsn: 1.793 ± 0.595
5.977SerPro: 5.977 ± 1.793
1.793SerGln: 1.793 ± 1.013
5.977SerArg: 5.977 ± 2.885
8.368SerSer: 8.368 ± 1.999
4.782SerThr: 4.782 ± 1.566
3.586SerVal: 3.586 ± 0.689
2.989SerTrp: 2.989 ± 1.475
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.184ThrAla: 4.184 ± 1.561
1.195ThrCys: 1.195 ± 1.163
5.977ThrAsp: 5.977 ± 3.745
1.195ThrGlu: 1.195 ± 0.481
2.989ThrPhe: 2.989 ± 1.635
7.77ThrGly: 7.77 ± 1.938
1.195ThrHis: 1.195 ± 0.675
1.195ThrIle: 1.195 ± 0.675
4.184ThrLys: 4.184 ± 1.265
2.989ThrLeu: 2.989 ± 1.028
1.195ThrMet: 1.195 ± 0.502
2.989ThrAsn: 2.989 ± 1.663
5.977ThrPro: 5.977 ± 1.747
0.598ThrGln: 0.598 ± 0.338
4.184ThrArg: 4.184 ± 2.537
3.586ThrSer: 3.586 ± 1.189
4.782ThrThr: 4.782 ± 1.358
1.793ThrVal: 1.793 ± 0.898
2.391ThrTrp: 2.391 ± 2.043
1.195ThrTyr: 1.195 ± 0.481
0.0ThrXaa: 0.0 ± 0.0
Val
1.793ValAla: 1.793 ± 0.706
1.195ValCys: 1.195 ± 0.481
4.782ValAsp: 4.782 ± 1.744
1.195ValGlu: 1.195 ± 0.616
3.586ValPhe: 3.586 ± 0.994
1.793ValGly: 1.793 ± 1.013
3.586ValHis: 3.586 ± 1.209
0.598ValIle: 0.598 ± 0.581
0.598ValLys: 0.598 ± 0.581
4.782ValLeu: 4.782 ± 1.609
1.793ValMet: 1.793 ± 1.196
1.195ValAsn: 1.195 ± 0.481
4.184ValPro: 4.184 ± 1.866
2.989ValGln: 2.989 ± 1.008
2.391ValArg: 2.391 ± 1.351
6.575ValSer: 6.575 ± 0.337
2.989ValThr: 2.989 ± 0.566
5.977ValVal: 5.977 ± 2.264
0.0ValTrp: 0.0 ± 0.0
1.793ValTyr: 1.793 ± 1.013
0.0ValXaa: 0.0 ± 0.0
Trp
2.989TrpAla: 2.989 ± 0.985
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.793TrpGlu: 1.793 ± 0.595
1.195TrpPhe: 1.195 ± 1.021
1.195TrpGly: 1.195 ± 1.163
1.195TrpHis: 1.195 ± 1.359
1.793TrpIle: 1.793 ± 1.345
1.195TrpLys: 1.195 ± 0.616
4.782TrpLeu: 4.782 ± 1.698
0.598TrpMet: 0.598 ± 0.581
0.0TrpAsn: 0.0 ± 0.0
0.598TrpPro: 0.598 ± 0.338
1.195TrpGln: 1.195 ± 0.481
0.0TrpArg: 0.0 ± 0.0
2.391TrpSer: 2.391 ± 0.958
1.793TrpThr: 1.793 ± 1.013
0.598TrpVal: 0.598 ± 0.338
1.793TrpTrp: 1.793 ± 1.744
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.195TyrAla: 1.195 ± 0.675
0.0TyrCys: 0.0 ± 0.0
1.195TyrAsp: 1.195 ± 0.675
0.0TyrGlu: 0.0 ± 0.0
0.598TyrPhe: 0.598 ± 0.581
1.793TyrGly: 1.793 ± 0.898
1.793TyrHis: 1.793 ± 0.81
2.989TyrIle: 2.989 ± 0.898
3.586TyrLys: 3.586 ± 1.088
1.793TyrLeu: 1.793 ± 0.81
0.598TyrMet: 0.598 ± 0.338
0.598TyrAsn: 0.598 ± 0.338
2.391TyrPro: 2.391 ± 0.839
1.793TyrGln: 1.793 ± 0.731
1.793TyrArg: 1.793 ± 0.898
0.598TyrSer: 0.598 ± 0.338
0.598TyrThr: 0.598 ± 0.338
1.195TyrVal: 1.195 ± 0.675
1.195TyrTrp: 1.195 ± 0.675
0.598TyrTyr: 0.598 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1674 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski