Shigella dysenteriae serotype 1 (strain Sd197)
Average proteome isoelectric point is 6.77
Get precalculated fractions of proteins
Acidic
pI < 6.8
6.8-7.4
pI > 7.4
Basic
All
Note: above files contain also dissociation constants (pKa)
Virtual 2D-PAGE plot for 3897 proteins (isoelectric point calculated using IPC2_protein)
Get csv file with sequences according to given criteria:
* You can choose from 21 different methods for calculating isoelectric point
Summary statistics related to proteome-wise predictions
Protein with the lowest isoelectric point:
>tr|Q32J74|Q32J74_SHIDS Uncharacterized protein OS=Shigella dysenteriae serotype 1 (strain Sd197) OX=300267 GN=SDY_0422 PE=4 SV=1
MM1 pKa = 7.44 GVTAAQVSAWPAGTVNIAVSGEE23 pKa = 4.02 SSAGNPISITHH34 pKa = 6.85 PVTVDD39 pKa = 3.37 LTPAAITINTIATDD53 pKa = 3.79 DD54 pKa = 4.18 VINAAEE60 pKa = 4.24 KK61 pKa = 10.47 GANLTLSGTTTNVEE75 pKa = 3.87 AGQTVTVTFGGKK87 pKa = 9.87 NYY89 pKa = 7.45 TASVAGDD96 pKa = 3.94 GSWTATVPAADD107 pKa = 4.29 LAALPEE113 pKa = 4.73 GSASAQASVSNINGNSASAVHH134 pKa = 6.47 NYY136 pKa = 10.21 SVDD139 pKa = 3.25 SSAPTIIINTVASDD153 pKa = 4.03 NIVNASEE160 pKa = 4.07 ADD162 pKa = 3.3 AGVTVSGSTTAEE174 pKa = 3.75 AGQIVTITLNSPTVQTYY191 pKa = 8.96 QATVQADD198 pKa = 4.12 GSWSINIPAADD209 pKa = 4.62 LEE211 pKa = 4.22 ALTDD215 pKa = 4.13 GSHH218 pKa = 6.42 TLTATVNDD226 pKa = 4.06 KK227 pKa = 11.14 AGNPASTTHH236 pKa = 6.32 NLAVDD241 pKa = 3.83 LTVPVLTINTIAGDD255 pKa = 4.23 DD256 pKa = 3.92 IINATEE262 pKa = 4.03 HH263 pKa = 5.67 GQALVISGSSTGGEE277 pKa = 3.94 AGDD280 pKa = 3.77 VVSVTLNSKK289 pKa = 9.05 TYY291 pKa = 7.54 TTTLDD296 pKa = 3.47 ASGNWSVGVPAADD309 pKa = 3.56 VTALGSGPQTVTATVTDD326 pKa = 3.67 AAGNSDD332 pKa = 4.11 NEE334 pKa = 4.52 THH336 pKa = 5.96 TVTVNLTAPTIGINTIATDD355 pKa = 3.7 DD356 pKa = 4.25 VINATEE362 pKa = 4.42 KK363 pKa = 11.1 GADD366 pKa = 3.7 LQISGTSNQPAGTTITVTLNGQNYY390 pKa = 7.45 TATTDD395 pKa = 3.95 ASGNWSTTVPASAVGALGEE414 pKa = 3.99 ASYY417 pKa = 9.98 TVTANVTDD425 pKa = 3.6 SAGNSNSASHH435 pKa = 6.17 NVQVNTALPGVTLNPVASDD454 pKa = 4.36 DD455 pKa = 4.26 IINAAEE461 pKa = 4.13 SGVAQTISGQVTGAAAGDD479 pKa = 4.13 TVTVTLGGKK488 pKa = 8.14 TYY490 pKa = 9.31 TATVAGWW497 pKa = 3.03
Molecular weight: 48.81 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 3.732
IPC2_protein 3.656
IPC_protein 3.681
Toseland 3.452
ProMoST 3.859
Dawson 3.694
Bjellqvist 3.846
Wikipedia 3.643
Rodwell 3.503
Grimsley 3.363
Solomon 3.681
Lehninger 3.63
Nozaki 3.795
DTASelect 4.075
Thurlkill 3.516
EMBOSS 3.643
Sillero 3.808
Patrickios 1.863
IPC_peptide 3.668
IPC2_peptide 3.783
IPC2.peptide.svr19 3.744
Protein with the highest isoelectric point:
>tr|Q329J2|Q329J2_SHIDS Uncharacterized protein OS=Shigella dysenteriae serotype 1 (strain Sd197) OX=300267 GN=yjdB PE=4 SV=1
MM1 pKa = 7.45 KK2 pKa = 9.51 RR3 pKa = 11.84 TFQPSVLKK11 pKa = 10.6 RR12 pKa = 11.84 NRR14 pKa = 11.84 SHH16 pKa = 7.16 GFRR19 pKa = 11.84 ARR21 pKa = 11.84 MATKK25 pKa = 10.4 NGRR28 pKa = 11.84 QVLARR33 pKa = 11.84 RR34 pKa = 11.84 RR35 pKa = 11.84 AKK37 pKa = 10.22 GRR39 pKa = 11.84 ARR41 pKa = 11.84 LTVSKK46 pKa = 10.99
Molecular weight: 5.38 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 9.523
IPC2_protein 11.242
IPC_protein 12.837
Toseland 13.013
ProMoST 13.495
Dawson 13.013
Bjellqvist 12.998
Wikipedia 13.481
Rodwell 12.676
Grimsley 13.042
Solomon 13.495
Lehninger 13.408
Nozaki 13.013
DTASelect 12.998
Thurlkill 13.013
EMBOSS 13.51
Sillero 13.013
Patrickios 12.398
IPC_peptide 13.51
IPC2_peptide 12.486
IPC2.peptide.svr19 9.177
Peptides (in silico digests for buttom-up proteomics)
Below you can find
in silico digests of the whole proteome with Trypsin, Chymotrypsin, Trypsin+LysC, LysN, ArgC proteases suitable for different mass spec machines.
Try ESI
ChTry ESI
ArgC ESI
LysN ESI
TryLysC ESI
Try MALDI
ChTry MALDI
ArgC MALDI
LysN MALDI
TryLysC MALDI
Try LTQ
ChTry LTQ
ArgC LTQ
LysN LTQ
TryLysC LTQ
Try MSlow
ChTry MSlow
ArgC MSlow
LysN MSlow
TryLysC MSlow
Try MShigh
ChTry MShigh
ArgC MShigh
LysN MShigh
TryLysC MShigh
General Statistics
Number of major isoforms
Number of additional isoforms
Number of all proteins
Number of amino acids
Min. Seq. Length
Max. Seq. Length
Avg. Seq. Length
Avg. Mol. Weight
3897
0
3897
1094416
14
1588
280.8
31.19
Amino acid frequency
Ala
Cys
Asp
Glu
Phe
Gly
His
Ile
Lys
Leu
9.419 ± 0.042
1.191 ± 0.016
5.098 ± 0.033
5.846 ± 0.04
3.796 ± 0.028
7.23 ± 0.043
2.321 ± 0.018
5.86 ± 0.032
4.476 ± 0.035
10.701 ± 0.049
Met
Asn
Gln
Pro
Arg
Ser
Thr
Val
Trp
Tyr
2.869 ± 0.02
3.883 ± 0.026
4.354 ± 0.023
4.433 ± 0.033
5.884 ± 0.039
5.815 ± 0.033
5.451 ± 0.035
7.022 ± 0.034
1.516 ± 0.018
2.836 ± 0.025
Note: For amino acid frequency statistics the error has been estimated with the bootstraping (x100) at the protein level
Most of the basic statistics you can see at this page can be downloaded from this CSV file
For dipeptide frequency statistics click here