Skip to content

download small testing datasets useful for assessing tool functionality, e.g. jaccard similarity over species at increasing evol. distances

bluegenes/2018-test_datasets

Repository files navigation

Download Test Datasets from NCBI

Intended for downloading small testing datasets useful for assessing tool functionality, e.g. jaccard similarity over species at increasing evol. distances.

Clone the Repo:

git clone https://github.com/bluegenes/2018-test_datasets
cd 2018-test_datasets

Install the required software. To do this in a new conda environment called dl-test-datasets-env, run the following:

conda env create -f environment.yml -n dl-test-datasets-env
# then enter into that environment:
conda activate dl-test-datasets-env

usage:

python download_genbank_datasets.py denticola.csv -o test --subfolders --genbank

If you'd like to download RNA or Protein files as well, add the --rna or --protein flags.

About

download small testing datasets useful for assessing tool functionality, e.g. jaccard similarity over species at increasing evol. distances

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •