Vesicular stomatitis Indiana virus (strain San Juan) (VSIV)
Average proteome isoelectric point is 7.28
Get precalculated fractions of proteins
Acidic
pI < 6.8
6.8-7.4
pI > 7.4
Basic
All
Note: above files contain also dissociation constants (pKa)
Virtual 2D-PAGE plot for 9 proteins (isoelectric point calculated using IPC2_protein)
Get csv file with sequences according to given criteria:
* You can choose from 21 different methods for calculating isoelectric point
Summary statistics related to proteome-wise predictions
Protein with the lowest isoelectric point:
>sp|P03521|NCAP_VSIVA Nucleoprotein OS=Vesicular stomatitis Indiana virus (strain San Juan) OX=11285 GN=N PE=1 SV=1
MM1 pKa = 8.26 DD2 pKa = 5.12 NLTKK6 pKa = 10.36 VRR8 pKa = 11.84 EE9 pKa = 4.07 YY10 pKa = 10.74 LKK12 pKa = 10.75 SYY14 pKa = 10.66 SRR16 pKa = 11.84 LDD18 pKa = 3.47 QAVGEE23 pKa = 4.08 IDD25 pKa = 3.96 EE26 pKa = 5.27 IEE28 pKa = 3.97 AQRR31 pKa = 11.84 AEE33 pKa = 3.95 KK34 pKa = 10.72 SNYY37 pKa = 9.35 EE38 pKa = 4.03 LFQEE42 pKa = 4.94 DD43 pKa = 3.8 GVEE46 pKa = 4.08 EE47 pKa = 4.23 HH48 pKa = 6.6 TKK50 pKa = 10.39 PSYY53 pKa = 10.31 FQAADD58 pKa = 3.66 DD59 pKa = 4.51 SDD61 pKa = 4.18 TEE63 pKa = 4.31 SEE65 pKa = 5.06 PEE67 pKa = 3.81 IEE69 pKa = 4.88 DD70 pKa = 3.57 NQGLYY75 pKa = 10.33 AQDD78 pKa = 3.9 PEE80 pKa = 4.32 AEE82 pKa = 4.15 QVEE85 pKa = 4.84 GFIQGPLDD93 pKa = 4.24 DD94 pKa = 4.68 YY95 pKa = 11.81 ADD97 pKa = 3.85 EE98 pKa = 4.35 EE99 pKa = 4.7 VDD101 pKa = 4.84 VVFTSDD107 pKa = 3.02 WKK109 pKa = 10.5 PPEE112 pKa = 4.5 LEE114 pKa = 3.95 SDD116 pKa = 3.58 EE117 pKa = 4.48 HH118 pKa = 6.73 GKK120 pKa = 7.3 TLRR123 pKa = 11.84 LTSPEE128 pKa = 4.05 GLSGEE133 pKa = 4.78 QKK135 pKa = 10.45 SQWLSTIKK143 pKa = 10.75 AVVQSAKK150 pKa = 9.7 YY151 pKa = 8.96 WNLAEE156 pKa = 4.46 CTFEE160 pKa = 4.44 ASGEE164 pKa = 4.14 GVIMKK169 pKa = 9.27 EE170 pKa = 3.92 RR171 pKa = 11.84 QITPDD176 pKa = 3.32 VYY178 pKa = 11.16 KK179 pKa = 9.69 VTPVMNTHH187 pKa = 7.32 PSQSEE192 pKa = 3.94 AVSDD196 pKa = 3.56 VWSLSKK202 pKa = 9.99 TSMTFQPKK210 pKa = 9.52 KK211 pKa = 10.78 ASLQPLTISLDD222 pKa = 3.55 EE223 pKa = 4.62 LFSSRR228 pKa = 11.84 GEE230 pKa = 4.17 FISVGGDD237 pKa = 2.49 GRR239 pKa = 11.84 MSHH242 pKa = 6.7 KK243 pKa = 10.31 EE244 pKa = 3.81 AILLGLRR251 pKa = 11.84 YY252 pKa = 9.68 KK253 pKa = 10.6 KK254 pKa = 10.43 LYY256 pKa = 8.53 NQARR260 pKa = 11.84 VKK262 pKa = 11.01 YY263 pKa = 9.84 SLL265 pKa = 3.74
Molecular weight: 29.91 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 4.553
IPC2_protein 4.495
IPC_protein 4.431
Toseland 4.279
ProMoST 4.533
Dawson 4.368
Bjellqvist 4.52
Wikipedia 4.228
Rodwell 4.266
Grimsley 4.19
Solomon 4.368
Lehninger 4.317
Nozaki 4.482
DTASelect 4.609
Thurlkill 4.279
EMBOSS 4.24
Sillero 4.546
Patrickios 3.859
IPC_peptide 4.38
IPC2_peptide 4.533
IPC2.peptide.svr19 4.486
Protein with the highest isoelectric point:
>sp|P0C2X2-2|C-2_VSIVA Isoform of P0C2X2 Isoform C of Protein C' OS=Vesicular stomatitis Indiana virus (strain San Juan) OX=11285 GN=P PE=4 SV=1
MM1 pKa = 7.22 SCSKK5 pKa = 10.58 RR6 pKa = 11.84 MEE8 pKa = 4.21 WKK10 pKa = 10.55 SILSPLIFRR19 pKa = 11.84 QQMILTQNLNQKK31 pKa = 10.15 LKK33 pKa = 8.68 TIKK36 pKa = 10.41 VCMHH40 pKa = 7.56 RR41 pKa = 11.84 IQKK44 pKa = 10.01 LSKK47 pKa = 10.56 LKK49 pKa = 10.62 ALYY52 pKa = 10.18 RR53 pKa = 11.84 GLL55 pKa = 3.84
Molecular weight: 6.58 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 9.277
IPC2_protein 9.94
IPC_protein 10.394
Toseland 11.111
ProMoST 10.687
Dawson 11.169
Bjellqvist 10.804
Wikipedia 11.316
Rodwell 11.652
Grimsley 11.184
Solomon 11.272
Lehninger 11.257
Nozaki 11.096
DTASelect 10.804
Thurlkill 11.082
EMBOSS 11.506
Sillero 11.096
Patrickios 11.403
IPC_peptide 11.286
IPC2_peptide 9.765
IPC2.peptide.svr19 8.509
Peptides (in silico digests for buttom-up proteomics)
Below you can find
in silico digests of the whole proteome with Trypsin, Chymotrypsin, Trypsin+LysC, LysN, ArgC proteases suitable for different mass spec machines.
Try ESI
ChTry ESI
ArgC ESI
LysN ESI
TryLysC ESI
Try MALDI
ChTry MALDI
ArgC MALDI
LysN MALDI
TryLysC MALDI
Try LTQ
ChTry LTQ
ArgC LTQ
LysN LTQ
TryLysC LTQ
Try MSlow
ChTry MSlow
ArgC MSlow
LysN MSlow
TryLysC MSlow
Try MShigh
ChTry MShigh
ArgC MShigh
LysN MShigh
TryLysC MShigh
General Statistics
Number of major isoforms
Number of additional isoforms
Number of all proteins
Number of amino acids
Min. Seq. Length
Max. Seq. Length
Avg. Seq. Length
Avg. Mol. Weight
6
3
9
4034
55
2109
448.2
51.08
Amino acid frequency
Ala
Cys
Asp
Glu
Phe
Gly
His
Ile
Lys
Leu
5.181 ± 0.635
1.537 ± 0.305
5.85 ± 0.394
5.478 ± 0.625
4.636 ± 0.386
6.148 ± 0.341
2.9 ± 0.328
6.52 ± 0.64
6.743 ± 0.606
9.395 ± 0.612
Met
Asn
Gln
Pro
Arg
Ser
Thr
Val
Trp
Tyr
3.198 ± 0.486
3.966 ± 0.419
5.082 ± 0.255
3.396 ± 0.424
5.057 ± 0.482
8.379 ± 0.325
5.677 ± 0.214
5.032 ± 0.277
1.958 ± 0.188
3.867 ± 0.38
Note: For amino acid frequency statistics the error has been estimated with the bootstraping (x100) at the protein level
Most of the basic statistics you can see at this page can be downloaded from this CSV file
For dipeptide frequency statistics click here