Skip to content

Pipelines for NGS data preprocessing by the Bock lab and friends

Notifications You must be signed in to change notification settings

himrichova/open_pipelines

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pipelines

These are some of the NGS pipelines developed and used in the Bock lab at CeMM.

This repository contains pipelines for processing NGS data and associated scripts used by them (in the /pipelines/tools subdirectory). Pipelines here are configured to work with looper and use pypiper (see the corresponding repositories). These pipelines work with metadata formatted as a portable encapsulated project.

Installation and usage

  1. Install looper and pypiper:
  • pip install https://github.com/pepkit/looper/zipball/v0.9.1
  • pip install https://github.com/databio/pypiper/zipball/v0.6
  1. Clone this repository:
  • git clone git@github.com:epigen/open_pipelines.git
  1. Produce a configuration file for your project (see here how to do it).

  2. Link the pipelines to your project by pointing to the pipeline interfaces configuration:

        metadata:
            pipeline_interfaces: open_pipelines/pipeline_interface.yaml
  3. Run all jobs using looper:

  • looper run project/metadata/project_config.yaml

If you are just using a pipeline in a project, and you are not developing the pipeline, you should treat this cloned repo as read-only, frozen code, which should reside in a shared project workspace. There should be only one clone for the project, to avoid running data under changing pipeline versions (you should not pull any pipeline updates unless you plan to re-run the whole thing).

Test data for pipelines

Small example data for several pipeline types is available in the microtest repository

Pipeline documentation

See the dedicated pages detailing the steps and output of each of the pipelines:

The remaining pipelines will get their dedicated documentation soon.

Contributing

We appreciate and encourage contributions to existing pipelines or submitions of new ones.

Simply clone the repository, make your changes and create a pull request with the changes in the dev branch.

About

Pipelines for NGS data preprocessing by the Bock lab and friends

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages

  • Python 96.3%
  • R 3.7%