Skip to content

amrfaisal/hecatomb

 
 

Repository files navigation

Edwards Lab License: MIT
GitHub language count Downloads

hecatomb

A hecatomb is a great sacrifice; a great number of people, animals or things, especially as sacrificed or destroyed. In this case we provide a hecatomb of false-positive viral sequences. Hecatomb is desi

What is hecatomb?

Hecatomb is a pipline for culling spurious sequences from viral metagenomes.

Is hecatomb for me?

If you have:

  • viral metagenomes
  • sequences that might be viral
  • RoundA/B (DNA and cDNA from RNA viral genomes) sequences

Then hecatomb is for you!

Installing hecatomb

Please see the detailed installation instructions. Its really quite simple, you need conda and snakemake and we'll do the rest.

Running hecatomb

You will need a directory with some gzip-compressed fastq files in them. At the moment, hecatomb requires paired end reads. The only requirement is that your paired end files contain _R1 for the first mate pair and _R2 for the second mate pair. We will figure out the file names by just looking for _R1.

The test_data/fastq directory has some example datasets that you can run through the pipeline to see if it is working.

You should be able to cd into the GitHub directory and run snakemake directly:

conda install -c bioconda -c conda-forge snakemake
git clone https://github.com/shandley/hecatomb.git
cd hecatomb
snakemake --configfile snakemake/config/sample_config.yaml -s snakemake/workflow/download_databases.smk --cores 4 --use-conda
snakemake --configfile snakemake/config/sample_config.yaml -s snakemake/workflow/Snakefile --cores 4 --use-conda

Our installation documentation includes help on setting up snakemake profiles, so that you can run that same command as:

cd hecatomb
snakemake --configfile configs/sample_config.json --snakefile snakemake/contaminant_removal.snakefile

Config File

we recommend that you make a copy of the config file in your working directory, and edit it there. That way you can keep track of any changes you've made, and rerun code as ncessary.

cd my_working_directory
cp ~/hecatomb/configs/sample_config.json ./config.json
nano config.json

About

hecatomb

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 43.1%
  • Python 37.6%
  • R 19.3%