Skip to content

Commit

Permalink
update docs & include OTUs from ghost-tree
Browse files Browse the repository at this point in the history
  • Loading branch information
JTFouquier committed Aug 11, 2015
1 parent 52b8b5e commit 08b1910
Show file tree
Hide file tree
Showing 2 changed files with 45 additions and 5 deletions.
22 changes: 17 additions & 5 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -9,9 +9,18 @@ diversity analyses. One database is used as a backbone or scaffold because it
provides better phylogeny across all phyla, and the other database provides
finer taxonomic resolution.

For application to ITS, you don't need to install ghost-tree, but can use our pre-built trees. The ``trees`` directory contains pre-built reference phylogenetic trees for the `UNITE QIIME reference files available here <https://unite.ut.ee/repository.php>`_.

The most recent ghost-trees we've created are for the `sh_qiime_release_s_30.12.2014 release of UNITE, available here <https://unite.ut.ee/sh_files/sh_qiime_release_s_30.12.2014.zip>`_. Depending on which of the fasta files you're using from that directory, you'd use the corresponding ghost-tree listed below:
For application to ITS, you don't need to install ghost-tree, but can use our
pre-built trees. The tree that you download needs to match the UNITE database
that you used (or plan to use) for your ITS analyses. The ``trees`` directory
contains pre-built reference phylogenetic trees for the `UNITE QIIME reference
files available here
<https://unite.ut.ee/repository.php>`_.

The most recent ghost-trees we've created are for the
`sh_qiime_release_s_30.12.2014 release of UNITE, available here
<https://unite.ut.ee/sh_files/sh_qiime_release_s_30.12.2014.zip>`_.
Depending on which of the fasta files you're using from that directory,
you'd use the corresponding ghost-tree listed below:

* For ``sh_refs_qiime_ver6_97_30.12.2014.fasta``, use `ghosttree_UNITEv6_30.12.2014S_97_100clusters_052515.nwk <https://raw.githubusercontent.com/JTFouquier/ghost-tree/master/trees/ghost-trees_052515/ghosttree_UNITEv6_30.12.2014S_97_100clusters_052515.nwk>`_
* For ``sh_refs_qiime_ver6_97_30.12.2014.fasta``, use `ghosttree_UNITEv6_30.12.2014S_97_80clusters_052515.nwk <https://github.com/JTFouquier/ghost-tree/raw/master/trees/ghost-trees_052515/ghosttree_UNITEv6_30.12.2014S_97_80clusters_052515.nwk>`_
Expand All @@ -22,6 +31,9 @@ The most recent ghost-trees we've created are for the `sh_qiime_release_s_30.12.



If you are an experienced developer, or are interested in trying out the
ghost-tree tool via command line, then you will need to follow these
directions:

ghost-tree requires two external software tools to build a hybrid-tree or
the "ghost-tree":
Expand All @@ -36,10 +48,10 @@ To optionally (recommended) regroup the extension OTUs, ghost-tree requires
SUMACLUST:

SUMACLUST (Version 1.0.01):
http://metabarcoding.org/sumatra/wiki/download
https://git.metabarcoding.org/obitools/sumaclust/wikis/home/

Please install the software and make sure that it is in your PATH variable.
To test this you should be able to type “muscle, fasttree or sumaclust” from
To test this you need to be able to type “muscle, fasttree or sumaclust” from
command line and see the “usage” or “help” documentation for each
software tool.

Expand Down
28 changes: 28 additions & 0 deletions helper_files/get_otus_from_ghost_tree.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
"""
This file can be downloaded and used to create a .txt file containing only
the accession numbers from the ghost-tree.nwk that you plan to use for your
analyses.
You must have skbio installed. http://scikit-bio.org/
You will then use "ghost_tree_tips.txt" output file containing the accession
numbers to filter your .biom table so that it contains only the OTUs that
are in the ghost-tree.nwk that you are using.
http://qiime.org/scripts/filter_otus_from_otu_table.html
Use the required arguments and the following two optional arguments:
-e, --otu_ids_to_exclude_fp
(provide the text file containing OTU ids to exclude)
--negate_ids_to_exclude
(this will keep OTUs in otu_ids_to_exclude_fp, rather than discard them)
"""
from skbio import TreeNode

ghosttree = TreeNode.read("ghost_tree.nwk") # insert your ghost-tree name here
output = open("ghost_tree_tips.txt", "w")

for node in ghosttree.tips():
output.write(str(node.name)+"\n")

output.close()

0 comments on commit 08b1910

Please sign in to comment.