Skip to content
The Read Origin Protocol (ROP) is a computational protocol that aims to discover the source of all reads, including those originating from repeat sequences, recombinant B and T cell receptors, and microbial communities.
Branch: master
Clone or download
Latest commit 2e08815 Nov 28, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
example
figures
tools Update dependencies Jul 3, 2018
.editorconfig Add .editorconfig Oct 23, 2018
.gitignore Update README.md and minor fixes Mar 19, 2018
LICENSE.txt Remove extraneous files Mar 19, 2018
README.md Re-enable metaphlan Nov 28, 2018
environment.yml Update dependencies Jul 3, 2018
helper.py Fix up options Mar 21, 2018
install.sh Re-enable metaphlan Nov 28, 2018
rop.sh Re-enable metaphlan Nov 28, 2018

README.md

Read Origin Protocol

The Read Origin Protocol (ROP) is a computational protocol that aims to discover the source of all reads, including those originating from complex RNA molecules, recombinant antibodies, and microbial communities.

Written by:

at the University of California, Los Angeles (UCLA).

Released under the terms of the General Public License, version 3.0 (GPLv3). For more information, please visit: https://github.com/smangul1/rop/wiki

Installing ROP

To install ROP, first clone this repository, then run

./install.sh

from the repository's directory. This will download dependencies and databases. The default installation will generally suffice, but the following options are available:

  • -c|--clean: Just remove installed tools.
    • To reinstall, use -r|--reinstall instead.
    • Overrides conflicting options.
  • -f|--force: Unlink databases.
    • Use with caution.
  • -n|--native: Use native python.
    • MiniConda will not be downloaded.
    • You may use environment.yml to set up your python environment.
  • -l|--link LINK: Link databases instead of downloading.
    • Useful if you previously downloaded an ROP database.
    • A symlink will be created in the current directory.
    • Overrides conflicting options.
  • -d|--db-dest DB_DEST (default: .): Change database download location.
    • Useful for managing space.
    • A symlink will be created in the current directory.
  • -o|--organism ORGANISM (default: human): Organism to download databases for.
    • Exactly one of the following: human, mouse.
  • -r|--reinstall: Reinstall tools, even if they're already present.
  • -s|--select-db SELECT_DB (default: all): Database(s) to download for the specified organism.
    • A comma-separated list of one or more of the following: basic, repeats, microbiome (which may be subdivided into metaphlan, viral, fungi, protozoa).
    • -s all selects everything.
  • -h|--help: Displays usage information.

Using ROP

To use ROP, run

rop.sh unmapped_reads output_dir

Unless otherwise specified using an option, unmapped_reads must be a .fastq/.fq file, and output_dir must not exist (it will be created). Results will be written to output_dir, with one subdirectory for every stage of the pipeline. The following options are available:

  • -o|--organism (default: human): Organism to run ROP for.
    • Exactly one of the following: human, mouse.
  • -s|--steps (default: all except lowq): Select the analysis modes to use.
    • A comma-separated list of one or more of the following: lowq, rdna, reference, repeats, circrna, immune, microbiome (which may be subdivided into metaphlan, bacteria, viral, fungi, protozoa).
    • circrna and bacteria are not available in the current release.
    • -s all selects everything.
  • -a|--fasta: Input unmapped reads in .fasta format instead of .fastq format. Forcibly disables low-quality read filtering.
  • -b|--bam: Input unmapped reads in .bam format instead of .fastq format.
  • -z|--gzip: gunzip the input file.
  • -d|--dev: Keep intermediate FASTA files.
    • Consumes extra space.
  • -f|--force: Overwrite the analysis destination directory.
  • -i|--ignore-extensions: Ignore incorrect .fastq/.fq/.fasta/.fa file extensions. Does not ignore incorrect .gz/.bam file extensions.
  • -m|--max: Use a liberal threshold when remapping to reference.
    • May account for more reads.
  • -x|--commands: Print all commands (diagnostic mode).
  • -h|--help: Displays usage information.

A small example file is included in the repository in various formats. To try it out, run one of the following commands from the repository directory:

rop.sh -b example/example.bam ropout
rop.sh example/example.fastq ropout
rop.sh -z example/example.fastq.gz ropout
rop.sh -a example/example.fasta ropout
rop.sh -az example/example.fasta.gz ropout

Then, browse to the ropout directory to see the analysis results!

You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.