Skip to content

drboothtj/getphylo_benchmarking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

getphylo_benchmarking

A repository for data and tools used to benchmark getphylo.

Contents

This repository contains the following directories and subdirectories:

benchmarking

This directory contains information for the performance benchmarking of getphylo.

accessions

This directory contains text files containing the NBCI accessions for the Streptomyces genomes that comprise each of the six benchmarking datasets.

trees

This directory contains the trees produced during benchmarking by getphylo, autoMLST and gtdb-tk.

case_studies

This directory contains data from four case studies used to demonstrate the utility of getphylo.

case_study_1

This directory contains data relevant to case study one - a bacterial phylogeny. It includes the input genbank files and the output alignment, partition and tree.

case_study_2

This directory contains data relevant to case study two - a phylogeny of a gene clusters related to resorculin biosynthesis. It includes a list of the input MiBiG accessions and the resorculin BGC as a genbank file. It also includes the output alignment, partition and tree.

case_study_3

This directory contains data relevant to case study three - a phylogeny of primates. It contains a list of NCBI accessions used as input and the output alignment, partition and tree.

case_study_4

This directory contains data relevant to case study four - a phylogeny of Eurotiomycete fungi. It contains a list of strains used as input and the output alignment, partition and tree.

scripts

This directory contains three scripts used during benchmark:

  • alignment_information.py - prints information, such as the number of informative sites, about the supplied alignment
  • gtdbtk_unrooted.bash - runs the first three modules of gtdb-tk de novo workflow to produce an unrooted tree
  • treesum.py - prints information, such as average branch support, about the supplied newick tree

Citation

If you use getphylo or make use of the scripts or curated data/case studies in this repository, please cite:

Booth, T. J., Shaw, S., & Weber, T. (2023). getphylo: rapid and automatic generation of multi-locus phylogenetic trees. BioRxiv, 2023.07.26.550493.

DOI: https://doi.org/10.1101/2023.07.26.550493

About

A repository for data and tools used to benchmark getphylo

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published