Marine Group I thaumarchaeote SCGC AAA799-O18

Taxonomy: cellular organisms; Archaea; TACK group; Thaumarchaeota; Thaumarchaeota incertae sedis; Marine Group I

Average proteome isoelectric point is 6.95

Get precalculated fractions of proteins

Acidic
pI < 6.8
6.8-7.4
pI > 7.4
Basic
    
All



Virtual 2D-PAGE plot for 697 proteins (isoelectric point calculated using IPC2_protein)

Get csv file with sequences according to given criteria:
        -     Method 

     -  kDa    
                                                                                      

* You can choose from 21 different methods for calculating isoelectric point

Summary statistics related to proteome-wise predictions


    

Protein with the lowest isoelectric point:
>tr|A0A087RMS4|A0A087RMS4_9ARCH Copper-containing nitrite reductase OS=Marine Group I thaumarchaeote SCGC AAA799-O18 OX=1502294 GN=AAA799O18_00513 PE=4 SV=1
MM1 pKa = 7.67LFFLVLSPAFVYY13 pKa = 10.75AEE15 pKa = 4.02PQTITVEE22 pKa = 4.02TDD24 pKa = 3.04YY25 pKa = 11.61LSYY28 pKa = 11.18EE29 pKa = 4.18KK30 pKa = 10.98GSVILVSGSITNFDD44 pKa = 3.07TSDD47 pKa = 3.38PVKK50 pKa = 10.53VYY52 pKa = 10.37EE53 pKa = 4.07VALRR57 pKa = 11.84IIDD60 pKa = 3.9PNNNIITISQILPSDD75 pKa = 3.64VGSFLYY81 pKa = 10.53NVNTAGSLWKK91 pKa = 10.18FAGDD95 pKa = 3.69YY96 pKa = 10.4NISVNYY102 pKa = 9.95GDD104 pKa = 4.21TSATTTFSFIIPEE117 pKa = 4.0VEE119 pKa = 3.53AAEE122 pKa = 4.06AAEE125 pKa = 4.0AAEE128 pKa = 4.12AAEE131 pKa = 4.12AAEE134 pKa = 4.12AAEE137 pKa = 4.12AAEE140 pKa = 4.09AAEE143 pKa = 4.27AAEE146 pKa = 4.54EE147 pKa = 4.29KK148 pKa = 10.59CGEE151 pKa = 4.21GTHH154 pKa = 7.26LEE156 pKa = 4.51DD157 pKa = 4.31GACVLDD163 pKa = 4.71EE164 pKa = 4.22IVSVEE169 pKa = 4.11STPAPSTSSAEE180 pKa = 4.43TVSSWIYY187 pKa = 10.4SITFAVLIAFVIAIFLYY204 pKa = 10.48LISRR208 pKa = 11.84ASRR211 pKa = 11.84KK212 pKa = 8.4KK213 pKa = 6.7TTT215 pKa = 3.38

Molecular weight:
22.94 kDa
Isoelectric point according different methods:






Protein with the highest isoelectric point:
>tr|A0A087RLE7|A0A087RLE7_9ARCH Uncharacterized protein OS=Marine Group I thaumarchaeote SCGC AAA799-O18 OX=1502294 GN=AAA799O18_00649 PE=4 SV=1
MM1 pKa = 7.19ATSKK5 pKa = 10.86AKK7 pKa = 10.44RR8 pKa = 11.84SAAAKK13 pKa = 9.55KK14 pKa = 9.82AARR17 pKa = 11.84TRR19 pKa = 11.84KK20 pKa = 9.66RR21 pKa = 11.84NAAKK25 pKa = 10.3RR26 pKa = 11.84KK27 pKa = 9.03AAAKK31 pKa = 9.77KK32 pKa = 9.77AAAKK36 pKa = 9.87RR37 pKa = 11.84KK38 pKa = 9.32RR39 pKa = 11.84RR40 pKa = 11.84AAAKK44 pKa = 10.01KK45 pKa = 9.27GGRR48 pKa = 11.84RR49 pKa = 11.84KK50 pKa = 9.36AAKK53 pKa = 10.06RR54 pKa = 11.84GGKK57 pKa = 9.66KK58 pKa = 10.03KK59 pKa = 10.52GGKK62 pKa = 9.37KK63 pKa = 10.01KK64 pKa = 10.54GGKK67 pKa = 8.45KK68 pKa = 9.58RR69 pKa = 11.84KK70 pKa = 7.73AAKK73 pKa = 9.75RR74 pKa = 11.84RR75 pKa = 11.84RR76 pKa = 3.63

Molecular weight:
8.21 kDa
Isoelectric point according different methods:






Peptides (in silico digests for buttom-up proteomics)

Below you can find in silico digests of the whole proteome with Trypsin, Chymotrypsin, Trypsin+LysC, LysN, ArgC proteases suitable for different mass spec machines.

Try
ESI
ChTry
ESI
ArgC
ESI
LysN
ESI
TryLysC
ESI

Try
MALDI
ChTry
MALDI
ArgC
MALDI
LysN
MALDI
TryLysC
MALDI

Try
LTQ
ChTry
LTQ
ArgC
LTQ
LysN
LTQ
TryLysC
LTQ

Try
MSlow
ChTry
MSlow
ArgC
MSlow
LysN
MSlow
TryLysC
MSlow

Try
MShigh
ChTry
MShigh
ArgC
MShigh
LysN
MShigh
TryLysC
MShigh

General Statistics

Number of major isoforms

Number of additional isoforms

Number of all proteins

Number of amino acids

Min. Seq. Length

Max. Seq. Length

Avg. Seq. Length

Avg. Mol. Weight

697

0

697

158835

22

1263

227.9

25.57

Amino acid frequency

Ala

Cys

Asp

Glu

Phe

Gly

His

Ile

Lys

Leu

5.967 ± 0.095

1.119 ± 0.036

5.627 ± 0.069

6.636 ± 0.085

4.398 ± 0.074

6.567 ± 0.098

1.85 ± 0.041

8.912 ± 0.079

9.213 ± 0.129

8.665 ± 0.085

Met

Asn

Gln

Pro

Arg

Ser

Thr

Val

Trp

Tyr

2.606 ± 0.045

5.094 ± 0.071

3.611 ± 0.054

2.957 ± 0.052

3.868 ± 0.071

6.92 ± 0.082

5.492 ± 0.072

6.501 ± 0.086

0.849 ± 0.031

3.108 ± 0.067

Note: For amino acid frequency statistics the error has been estimated with the bootstraping (x100) at the protein level

Most of the basic statistics you can see at this page can be downloaded from this CSV file

For dipeptide frequency statistics click here
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski