Contains 50 replicates, each with 200 gene trees from the S1001
dataset (seqlen = 400). The species tree is also provided in
s_tree.trees
to enable full reproduction of the results.
The file names are in the form of g200.p{x}.tre
, where
iqtree2 -nt 1 -m GTR+G -abayes -t $tree -s $alignment_with_taxa_removed
The file names are in the form of g200.clade.p{x}.tre
,
where
Scripts should be run from their respective directories as the working directory.
All scripts depend on TreeSwift. As such the scripts are licensed as GPL-3.0. Additionally, they depend on BioPython and.
s100_missing_data/count_leaves.py # count the number of leaves in each clade-based gene tree
random_clade.py # generate clade-based missing data for gene trees
random_missing.py # generate fixed-number-of-taxa-missing gene trees
These data are provided for the purpose of reproducing the results. The ownership of the data belongs to the authors of the original paper.
Footnotes
-
Zhang, C., Rabiee, M., Sayyari, E., Mirarab, S.: ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinformatics 19(6), 153 (2018). ↩