.. role:: emph .. role:: raw-html(raw) :format: html .. include:: links.rst .. _taxonomic-scope-doc: =========================== Determining taxonomic scope =========================== The following steps can be used to determine the taxonomic scope for a protein of interest. We recommend doing this for non-microbial proteins. For microbial proteins, the species tree is poorly defined; therefore, topiary automatically sets the scope to "all bacteria" or "all archaea". #. BLAST known protein sequences against the `non-redundant clustered `_ BLAST database, setting the :code:`Max target sequences` parameter to 1,000 or more. .. image:: _static/img/find-taxonomic-scope/step-1.png :align: center :alt: BLAST sequence against non-redundant clustered :height: 300 :raw-html:`
` #. Take a few representative sequences from the most divergent species in the outputs. These can be selected using the "Taxonomy" tab. .. image:: _static/img/find-taxonomic-scope/step-2.png :align: center :alt: BLAST sequence against non-redundant clustered :height: 300 :raw-html:`
` #. BLAST the divergent sequences back against the NCBI `non-redundant `_ database, limiting the search to the species from which you took your known sequences. (If, for example, you used a human sequence as your starting point, you would limit this "reciprocal" query to *Homo sapiens*.) .. image:: _static/img/find-taxonomic-scope/step-3.png :align: center :alt: BLAST sequence against non-redundant clustered :height: 300 :raw-html:`
` #. If this BLAST search pulls up your starting protein as a top hit, it is good evidence that the species from which the sequence came has the protein and should be included in the taxonomic scope. .. image:: _static/img/find-taxonomic-scope/step-4.png :align: center :alt: BLAST sequence against non-redundant clustered :height: 300 :raw-html:`
`