Candidatus Entotheonella factor
Average proteome isoelectric point is 6.06
Get precalculated fractions of proteins
Acidic
pI < 6.8
6.8-7.4
pI > 7.4
Basic
All
Note: above files contain also dissociation constants (pKa)
Virtual 2D-PAGE plot for 8429 proteins (isoelectric point calculated using IPC2_protein)
Get csv file with sequences according to given criteria:
* You can choose from 21 different methods for calculating isoelectric point
Summary statistics related to proteome-wise predictions
Protein with the lowest isoelectric point:
>tr|W4LAU2|W4LAU2_9BACT Dihydrolipoamide acetyltransferase component of pyruvate dehydrogenase complex OS=Candidatus Entotheonella factor OX=1429438 GN=ETSY1_32265 PE=3 SV=1
MM1 pKa = 7.52 LLSLNTIFTWCLLVAWLVKK20 pKa = 10.41 PFGFTADD27 pKa = 3.58 AQALLPSDD35 pKa = 4.12 VQTVDD40 pKa = 2.93 ATGDD44 pKa = 3.5 TMVNLTLPHH53 pKa = 6.83 GFTLSGAVRR62 pKa = 11.84 TEE64 pKa = 3.78 RR65 pKa = 11.84 GDD67 pKa = 3.65 PVALGTLTARR77 pKa = 11.84 STGGQFTSSLQGGDD91 pKa = 3.46 SAYY94 pKa = 10.62 HH95 pKa = 5.98 IALPGGTYY103 pKa = 10.67 DD104 pKa = 4.65 LNLSIPFLEE113 pKa = 4.43 SDD115 pKa = 3.49 EE116 pKa = 4.27 TGIPIFVYY124 pKa = 9.31 MASDD128 pKa = 3.77 VVAGLGIAADD138 pKa = 3.95 TTQDD142 pKa = 3.79 LVVPDD147 pKa = 4.86 LPDD150 pKa = 3.43 LVTVTGFVDD159 pKa = 3.74 PQDD162 pKa = 3.92 ADD164 pKa = 3.92 TVPTEE169 pKa = 4.07 GALQLVSTDD178 pKa = 3.04 GGTFTLALFEE188 pKa = 5.09 DD189 pKa = 6.15 FYY191 pKa = 10.01 TTRR194 pKa = 11.84 LPLNTYY200 pKa = 9.35 NVSAALTFTEE210 pKa = 5.42 AIGTNGPRR218 pKa = 11.84 TTAKK222 pKa = 10.66 VIVNVDD228 pKa = 3.01 AVTVNDD234 pKa = 4.09 EE235 pKa = 3.89 PTIYY239 pKa = 10.58 DD240 pKa = 3.14 ITLPATANLSGTVLDD255 pKa = 4.33 SAAAPLAPARR265 pKa = 11.84 VVATAVDD272 pKa = 3.38 ADD274 pKa = 3.86 AAVPSDD280 pKa = 3.19 IDD282 pKa = 3.77 FSCRR286 pKa = 11.84 PEE288 pKa = 4.02 VTFSLVPMPVSGTAFLYY305 pKa = 10.49 RR306 pKa = 11.84 EE307 pKa = 4.47 PGEE310 pKa = 4.27 SSTVGAYY317 pKa = 10.04 HH318 pKa = 6.42 MPLPIGTYY326 pKa = 9.69 HH327 pKa = 7.29 LNVTLGLEE335 pKa = 4.4 LQPEE339 pKa = 4.43 MVPTLVDD346 pKa = 3.64 PEE348 pKa = 4.52 VSSSSIVHH356 pKa = 6.24 IAMPDD361 pKa = 3.27 VALTTDD367 pKa = 3.81 TAQNLAVPALPPVVMVSGLVTDD389 pKa = 5.15 ALGQPVANAAVVAMTEE405 pKa = 3.94 ALAGVANAVRR415 pKa = 11.84 FLAEE419 pKa = 3.9 VQTQEE424 pKa = 4.23 DD425 pKa = 3.98 GSYY428 pKa = 10.1 EE429 pKa = 4.13 LPVLSGTSYY438 pKa = 11.05 TIMACPPRR446 pKa = 11.84 LQQ448 pKa = 3.45
Molecular weight: 46.62 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 3.685
IPC2_protein 3.757
IPC_protein 3.783
Toseland 3.554
ProMoST 3.948
Dawson 3.77
Bjellqvist 3.923
Wikipedia 3.719
Rodwell 3.605
Grimsley 3.465
Solomon 3.77
Lehninger 3.719
Nozaki 3.884
DTASelect 4.139
Thurlkill 3.605
EMBOSS 3.719
Sillero 3.897
Patrickios 1.227
IPC_peptide 3.757
IPC2_peptide 3.872
IPC2.peptide.svr19 3.806
Protein with the highest isoelectric point:
>tr|W4LII1|W4LII1_9BACT Uncharacterized protein OS=Candidatus Entotheonella factor OX=1429438 GN=ETSY1_21555 PE=4 SV=1
MM1 pKa = 7.89 PKK3 pKa = 9.71 MKK5 pKa = 9.95 TNRR8 pKa = 11.84 SAAKK12 pKa = 10.05 RR13 pKa = 11.84 FRR15 pKa = 11.84 LTARR19 pKa = 11.84 GKK21 pKa = 9.33 VRR23 pKa = 11.84 RR24 pKa = 11.84 NQAFTRR30 pKa = 11.84 HH31 pKa = 5.85 ILTKK35 pKa = 9.37 KK36 pKa = 6.62 TRR38 pKa = 11.84 KK39 pKa = 9.46 RR40 pKa = 11.84 KK41 pKa = 8.59 RR42 pKa = 11.84 QLRR45 pKa = 11.84 STTLVAAADD54 pKa = 3.59 APRR57 pKa = 11.84 IKK59 pKa = 10.35 RR60 pKa = 11.84 IISSS64 pKa = 3.44
Molecular weight: 7.47 kDa
Isoelectric point according different methods:
IPC2.protein.svr19 9.461
IPC2_protein 11.008
IPC_protein 12.588
Toseland 12.749
ProMoST 13.247
Dawson 12.749
Bjellqvist 12.749
Wikipedia 13.232
Rodwell 12.501
Grimsley 12.793
Solomon 13.247
Lehninger 13.144
Nozaki 12.749
DTASelect 12.749
Thurlkill 12.749
EMBOSS 13.247
Sillero 12.749
Patrickios 12.223
IPC_peptide 13.247
IPC2_peptide 12.237
IPC2.peptide.svr19 9.082
Peptides (in silico digests for buttom-up proteomics)
Below you can find
in silico digests of the whole proteome with Trypsin, Chymotrypsin, Trypsin+LysC, LysN, ArgC proteases suitable for different mass spec machines.
Try ESI
ChTry ESI
ArgC ESI
LysN ESI
TryLysC ESI
Try MALDI
ChTry MALDI
ArgC MALDI
LysN MALDI
TryLysC MALDI
Try LTQ
ChTry LTQ
ArgC LTQ
LysN LTQ
TryLysC LTQ
Try MSlow
ChTry MSlow
ArgC MSlow
LysN MSlow
TryLysC MSlow
Try MShigh
ChTry MShigh
ArgC MShigh
LysN MShigh
TryLysC MShigh
General Statistics
Number of major isoforms
Number of additional isoforms
Number of all proteins
Number of amino acids
Min. Seq. Length
Max. Seq. Length
Avg. Seq. Length
Avg. Mol. Weight
8429
0
8429
2493613
29
4723
295.8
32.74
Amino acid frequency
Ala
Cys
Asp
Glu
Phe
Gly
His
Ile
Lys
Leu
10.08 ± 0.028
1.07 ± 0.01
5.707 ± 0.021
6.085 ± 0.028
3.62 ± 0.016
7.591 ± 0.03
2.74 ± 0.014
5.275 ± 0.021
2.854 ± 0.021
10.487 ± 0.04
Met
Asn
Gln
Pro
Arg
Ser
Thr
Val
Trp
Tyr
2.573 ± 0.015
2.769 ± 0.018
5.327 ± 0.02
4.668 ± 0.026
6.685 ± 0.025
5.367 ± 0.017
5.535 ± 0.024
7.183 ± 0.023
1.505 ± 0.013
2.878 ± 0.016
Note: For amino acid frequency statistics the error has been estimated with the bootstraping (x100) at the protein level
Most of the basic statistics you can see at this page can be downloaded from this CSV file
For dipeptide frequency statistics click here