TagIdent tool (formerly GuessProt)


TagIdent is a tool which allows
  1. the generation of a list of proteins close to a given pI and Mw,
  2. the identification of proteins by matching a short sequence tag of up to 6 amino acids against proteins in the UniProt Knowledgebase (Swiss-Prot and TrEMBL) databases close to a given pI and Mw,
  3. the identification of proteins by their mass, if this mass has been determined by mass spectrometric techniques
for one or more species and with an optional keyword [references].
Documentation is available.
Fill out in the following form the pI (isoelectric point) and Mw (molecular weight) you want the search to focus on. The pI and Mw ranges allow you to specify the intervals of pI and Mw values in which the search should take place. Leave empty if you don't want to specify a range for pI or Mw.

The "Organism Name or Classfication or NCBI_TaxID" field allows you to specify terms to limit the search to a range of organisms or to a specific organism (species). A UniProtKB entry will only be returned, if all the given terms appear in either the OS or OC or OX lines of that entry. If in doubt, please consult the UniProtKB list of species. Try to be specific enough, in particular be aware that if you enter "rat", all taxonomical terms containing the substring "rat" will be searched, including "vertebRATa". Entering "rattus" or even "rattus norvegicus" (or even better its TaxID '10116') narrows down your search and probably corresponds better to what you are trying to do.

Scan in UniProtKB/Swiss-Prot can be restricted by the use of a keyword. This should be a keyword in the UniProtKB/Swiss-Prot KW lines. If in doubt, consult the UniProtKB list of keywords.

For example, entering the following values:


pI min = 5.15

pI max = 5.65

Mw = 35000

Mw range = 20%

OS/OC/OX = Viridiplantae

Keyword = chloroplast

will return all UniProtKB/Swiss-Prot entries for plant chloroplast proteins, with 5.15 <= pI <= 5.65 and 28000 <= Mw <= 42000.

When searching in UniProtKB/Swiss-Prot, TagIdent removes signal sequences and/or propeptides (as documented in the UniProtKB/Swiss-Prot feature table (FT lines)) before computing pI and Mw for each of the resulting chains.

The annotation in UniProtKB/TrEMBL is done automatically; it is incomplete and not always correct. Thus information on UniProtKB/TrEMBL FT lines is not used to process UniProtKB/TrEMBL proteins into mature chains or peptides (i.e. pI and Mw are always computed for the whole sequence), and the use of a keyword is not allowed for searches in UniProtKB/TrEMBL.


Name of your query (optional):
pI range: min max
Mw (in Daltons, not kD):
Mw range (in percent): %

Check for protein sequences with cysteines in reduced form (-SH) with cysteines oxidized (-S-S-)

Organism name or classification or NCBI_TaxID (e.g. homo sapiens, eukaryota, 9606):

Keyword from KW lines (for search in Swiss-Prot only):

Tagging
With this option, the first 40 amino acids (or the last 40 aa or the matching internal subsequence) of each protein are printed in the result, in addition to the protein name. You may optionally also enter a tag which will be matched with the sequences. If the tag is found in the displayed sequence, it will be shown in lower case, and the numerical position of the tag in the protein will be given. 'X' can be used to mean 'any amino acid'.
Tag (up to 6 amino acids):

Search for all possible tag permutations.

Display only the sequences matching the tag.

Include scan of your tag against all fragments in UniProtKB matching your organism criteria (and KW terms for Swiss-Prot only). Note: pI and Mw are not computed for fragments.

Display the predicted N-terminal sequence or the C-terminal sequence or the (internal) area of the protein that matches your tag.

Send the result by e-mail
With this option, you will receive the result by e-mail.
Your e-mail:

Database(s) on which the scan should be performed:
UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot and UniProtKB/TrEMBL
UniProtKB/TrEMBL
To run the search: or