Human papillomavirus 4

Taxonomy: Viruses; Monodnaviria; Shotokuvirae; Cossaviricota; Papovaviricetes; Zurhausenvirales; Papillomaviridae; Firstpapillomavirinae; Gammapapillomavirus; Gammapapillomavirus 1

Average proteome isoelectric point is 6.25

Get precalculated fractions of proteins

Acidic
pI < 6.8
6.8-7.4
pI > 7.4
Basic
    
All



Virtual 2D-PAGE plot for 8 proteins (isoelectric point calculated using IPC2_protein)

Get csv file with sequences according to given criteria:
        -     Method 

     -  kDa    
                                                                                      

* You can choose from 21 different methods for calculating isoelectric point

Summary statistics related to proteome-wise predictions


    

Protein with the lowest isoelectric point:
>sp|Q07860|VL1_HPV04 Major capsid protein L1 OS=Human papillomavirus 4 OX=10617 GN=L1 PE=3 SV=1
MM1 pKa = 7.65RR2 pKa = 11.84GAAPTVADD10 pKa = 5.52LNLEE14 pKa = 4.28LNDD17 pKa = 4.15LVLPANLLSEE27 pKa = 4.86EE28 pKa = 4.35VLQSSDD34 pKa = 3.58DD35 pKa = 3.79EE36 pKa = 4.64YY37 pKa = 11.21EE38 pKa = 3.89ITEE41 pKa = 4.28EE42 pKa = 4.03EE43 pKa = 4.62SVVPFRR49 pKa = 11.84IDD51 pKa = 3.21TCCYY55 pKa = 7.62RR56 pKa = 11.84CEE58 pKa = 3.93VAVRR62 pKa = 11.84ITLYY66 pKa = 10.69AAEE69 pKa = 4.67LGLRR73 pKa = 11.84TLEE76 pKa = 3.92QLLVEE81 pKa = 4.99GKK83 pKa = 8.34LTFCCTACARR93 pKa = 11.84SLNRR97 pKa = 11.84NGRR100 pKa = 3.54

Molecular weight:
11.13 kDa
Isoelectric point according different methods:






Protein with the highest isoelectric point:
>sp|Q07846|VE1_HPV04 Replication protein E1 OS=Human papillomavirus 4 OX=10617 GN=E1 PE=3 SV=1
MM1 pKa = 7.5KK2 pKa = 10.53LKK4 pKa = 10.44ILLHH8 pKa = 6.2SSTYY12 pKa = 10.42SSSFDD17 pKa = 3.46TEE19 pKa = 4.32EE20 pKa = 3.81QQLPGPSTSYY30 pKa = 11.33SEE32 pKa = 4.37VTEE35 pKa = 4.06QASPTRR41 pKa = 11.84RR42 pKa = 11.84RR43 pKa = 11.84KK44 pKa = 9.02PRR46 pKa = 11.84KK47 pKa = 9.24SDD49 pKa = 3.16ATSTTSPEE57 pKa = 4.19TEE59 pKa = 3.98GVRR62 pKa = 11.84LRR64 pKa = 11.84RR65 pKa = 11.84RR66 pKa = 11.84RR67 pKa = 11.84RR68 pKa = 11.84EE69 pKa = 4.04GKK71 pKa = 9.62SGPGSGEE78 pKa = 3.8TPRR81 pKa = 11.84KK82 pKa = 8.86RR83 pKa = 11.84RR84 pKa = 11.84RR85 pKa = 11.84GGGRR89 pKa = 11.84GGGEE93 pKa = 4.31TEE95 pKa = 4.81LGSAPSPAEE104 pKa = 3.46VGSRR108 pKa = 11.84HH109 pKa = 5.35RR110 pKa = 11.84QVEE113 pKa = 4.2RR114 pKa = 11.84QGLSRR119 pKa = 11.84LGLLQAEE126 pKa = 4.45ARR128 pKa = 11.84DD129 pKa = 3.77PPMILLKK136 pKa = 10.21GTANSLKK143 pKa = 8.53CWRR146 pKa = 11.84YY147 pKa = 9.45RR148 pKa = 11.84KK149 pKa = 10.15VNSNCCNFLFMSTVWNWVGDD169 pKa = 3.76CSHH172 pKa = 6.08NHH174 pKa = 4.83SRR176 pKa = 11.84MLIAFDD182 pKa = 3.96STDD185 pKa = 3.18QRR187 pKa = 11.84DD188 pKa = 3.55AFVKK192 pKa = 10.7HH193 pKa = 5.86NLFPKK198 pKa = 10.25LCTYY202 pKa = 10.19TYY204 pKa = 11.29GSLNSLL210 pKa = 3.91

Molecular weight:
23.51 kDa
Isoelectric point according different methods:






Peptides (in silico digests for buttom-up proteomics)

Below you can find in silico digests of the whole proteome with Trypsin, Chymotrypsin, Trypsin+LysC, LysN, ArgC proteases suitable for different mass spec machines.

Try
ESI
ChTry
ESI
ArgC
ESI
LysN
ESI
TryLysC
ESI

Try
MALDI
ChTry
MALDI
ArgC
MALDI
LysN
MALDI
TryLysC
MALDI

Try
LTQ
ChTry
LTQ
ArgC
LTQ
LysN
LTQ
TryLysC
LTQ

Try
MSlow
ChTry
MSlow
ArgC
MSlow
LysN
MSlow
TryLysC
MSlow

Try
MShigh
ChTry
MShigh
ArgC
MShigh
LysN
MShigh
TryLysC
MShigh

General Statistics

Number of major isoforms

Number of additional isoforms

Number of all proteins

Number of amino acids

Min. Seq. Length

Max. Seq. Length

Avg. Seq. Length

Avg. Mol. Weight

8

0

8

2615

100

599

326.9

36.86

Amino acid frequency

Ala

Cys

Asp

Glu

Phe

Gly

His

Ile

Lys

Leu

5.66 ± 0.26

2.371 ± 0.676

6.501 ± 0.551

6.195 ± 0.669

4.245 ± 0.604

6.195 ± 0.689

1.912 ± 0.206

4.895 ± 0.75

4.895 ± 0.706

9.254 ± 0.591

Met

Asn

Gln

Pro

Arg

Ser

Thr

Val

Trp

Tyr

1.721 ± 0.328

4.283 ± 0.446

5.927 ± 0.902

4.551 ± 0.426

6.539 ± 0.892

8.604 ± 0.595

6.272 ± 0.668

5.124 ± 0.532

1.3 ± 0.252

3.556 ± 0.362

Note: For amino acid frequency statistics the error has been estimated with the bootstraping (x100) at the protein level

Most of the basic statistics you can see at this page can be downloaded from this CSV file

For dipeptide frequency statistics click here
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski