dated-complete-tree

Code that takes the Open Tree of Life, resolves all polytomies, and interpolates dates for all nodes. For a full explanation of the algorithms and the results of using them, see this preprint.

Resolution and interpolation can be performed repeatedly to generate distributions of dated complete trees. Dates can be sampled from available date sources in the Open Tree phylesystem.

Optionally, you can also compute a distribution of evolutionary distinctiveness scores (you can choose ED/EDGE or ED2/EDGE2) for each leaf node.

Pre-computed median trees and tree distributions can be found at the accompanying Zenodo dataset.

Prerequisites:

The Open Tree of Life. By default we look for a folder ./opentree16.1_tree/ containing the files annotations.json and labelled_supertree/labelled_supertree_ottnames.tre.
The Open Tree Taxonomy. By default we look for a folder called ./ott3.7.3/ containing taxonomy.tsv.
The Python library chronosynth, available at https://github.com/OpenTreeOfLife/chronosynth/. This should be available in your PYTHONPATH.
A folder to cache the chronosynth output. By default we use ./chronosynth_date_info/.

Usage:

There are three options:

The file main.py can be run from the command line. For available options run: python main.py --help. For example: python main.py --num_trees=10 --pd_clades=pd_clades.txt will produce 10 trees with different topologies and a text file with phylogenetic diversity (PD) distributions for the clades specified in your text file ./pd_clades.txt (one Open Tree node name per line, e.g. Eukaryota_ott304358). The default output folder for the Newick-format trees and the PD file is ./output/.
The file main_non_exec.py contains code you can edit and run from your favourite Python IDE.
A Jupyter notebook edge2_notebook.ipynb is included, which will open a Jupyter (IPython) notebook to step through the process of loading trees and computes EDGE2 scores.

Subtrees

If you want to work with only a subtree or subset of species, see the notebook getting_a_subtree.ipynb.

Reproducing the pre-computed tree distributions

Should you wish to reproduce the distributions of trees used in the paper, you can use the following commands:

equal_splits_topo.tar.gz: python main.py --num_trees=501 --pd_clades=pd_clades.txt
equal_splits_both.tar.gz: python main.py --num_trees=501 --num_date_samples=3 --pd_clades=pd_clades.txt
birth_model_topo.tar.gz: python main.py --use_birth_model --num_trees=101 --pd_clades=pd_clades.txt
birth_model_both.tar.gz: python main.py --use_birth_model --num_trees=101 --num_date_samples=3 --pd_clades=pd_clades.txt

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
chronosynth_date_info		chronosynth_date_info
config		config
examples		examples
figures		figures
LICENSE.txt		LICENSE.txt
README.md		README.md
date_ph_stats.py		date_ph_stats.py
date_stats_14.txt		date_stats_14.txt
edge2_notebook.ipynb		edge2_notebook.ipynb
get_date_citations.py		get_date_citations.py
getting_a_subtree.ipynb		getting_a_subtree.ipynb
main.py		main.py
main_non_exec.py		main_non_exec.py
main_non_exec_ed_test.py		main_non_exec_ed_test.py
main_non_exec_getsubtree.py		main_non_exec_getsubtree.py
main_pd_dists.py		main_pd_dists.py
main_sample_dates.py		main_sample_dates.py
main_sample_dates_birthtree24.py		main_sample_dates_birthtree24.py
main_sample_dates_eqstree407.py		main_sample_dates_eqstree407.py
pd_clades.txt		pd_clades.txt
pd_date_sensitivity.py		pd_date_sensitivity.py
process_ed_scores.py		process_ed_scores.py
taxonomy_utils.py		taxonomy_utils.py
tree_checks.py		tree_checks.py
tree_dating.py		tree_dating.py
tree_fixing.py		tree_fixing.py
tree_labelling.py		tree_labelling.py
tree_loading.py		tree_loading.py
tree_metrics.py		tree_metrics.py
tree_plotting.py		tree_plotting.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dated-complete-tree

Prerequisites:

Usage:

Subtrees

Reproducing the pre-computed tree distributions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dated-complete-tree

Prerequisites:

Usage:

Subtrees

Reproducing the pre-computed tree distributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages