Skip to content

hurwitzlab/Radcot

Repository files navigation

Radcot

Determines bacterial genomes from DNA reads then shows gene expression differences from RNA reads

This is a monolithic program that combines the three steps (using the programs centrifuge, bowtie2, samtools, htseq-count, deseq2, and numerous python and R libraries)

The idea is that this one program combines all three steps so you do not have to run them separately.

Table of Contents

Created by gh-md-toc

These are the individual steps if you would like to install / run them separately. CAUTION: I do not guarantee that these are up-to-date with the full Radcot

  • Identify bacterial species from a metagenomic sample with DNA alignment
  • Download genomes and annotations of said species
  • Align RNA reads to abundanct bacterial genomes
  • Get counts of RNA alignments to transcript-producing genes (CDS)
  • Run Deseq2 guided by the experiment setup file (metadata.txt) that will output graphs / tables of differentially expressed genes

Installation / Quick start:

  1. Use https://www.imicrobe.us/#/apps to access the app with your cyverse login OR
  2. git clone https://github.com/hurwitzlab/radcot on a linux system where you have admin rights
  3. Install singularity
  4. Build the singularity image in singularity/ by executing make img
  5. Upload all the files to an HPC with a slurm scheduler1
  6. Run the program with sbatch run.sh [arguments]

1This can be adapted to run on other batch-scheduled high-performance computer systems but this has not been tested.


Example run:

Getting the example data

  1. Have iRODS tools installed on your HPC
  2. Get the example data with iget -r /iplant/home/shared/imicrobe/example_data/radcot-data * If using iMicrobe, this will be your input directory and you shouldn't need to download
  3. To check you have the data, make sure each DNA and RNA read file matches those found in metadata_template.txt

Running it

  1. Move the radcot.img into /stampede
  2. Issue make test
  3. Detail of expected output can be found in the google doc at the bottom of the page but briefly: * You should see centrifugue reports, genomes, bam files, count files and finally deseq graphs and tables in the output

References:

Thank you to @kyclark and @bhurwitz33 for helpful comments and suggestions.

About

Determines bacterial genomes from DNA reads then shows gene expression differences from RNA reads

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published