Skip to content

Collection of custom python bioinformatics utilities. Primarily for manipulating, recoding VCF files.

Notifications You must be signed in to change notification settings

labdave/Pipeline-Tools

 
 

Repository files navigation

Pipeline-Tools

Pipeline-Tools is a collection of command line tool for manipulating and summarizing VCF files.

Installation

Stand-alone installation

Pipeline-Tools is currently designed only for Linux systems. You will need to install and configure the following tool:

  1. Python v2.7.*

    You can check your Python version by running the following command in your terminal:

    $ python -V
    Python 2.7.10

    To install the correct version of Python, visit the official Python website.

  2. Python packages: numpy, scipy, pandas, matplotlib, and pyVCF

    You will need pip to install the above packages. After installing pip, run the following commands in your terminal:

    # Upgrade pip
    sudo pip install -U pip
    
    pip install numpy
    pip install scipy
    pip install pandas
    pip install matplotlib
    pip install pyVCF
  3. Clone the Pipeline-Tools repo

    # clone the repo
    git clone https://github.com/alexwaldrop/Pipeline-Tools.git

Docker Installation

Pipeline-Tools is maintained as a docker image and can be ported anywhere Docker is available.

The only pre-requisite here is the Docker client. Please execute the following command line to see if your system already have Docker-client installed or not.

$ sudo docker --version

If Docker is not installed on your system, you can get it from Docker-client.

After the Docker set up, please pull the Pipeline-Tools Docker image from the Docker Hub. To do so, please run the following command line:

$ sudo docker pull alexwaldrop/pipeline-tools:latest

You can run any of the Pipeline-Tools modules as Docker containers as follows:

$ sudo docker run --rm --user root alexwaldrop/pipeline-tools:latest "RecodeVCF.py --help"
$ sudo docker run --rm --user root alexwaldrop/pipeline-tools:latest "CatRecodedVCF.py --help"
$ sudo docker run --rm --user root alexwaldrop/pipeline-tools:latest "SummarizeVCF.py --help"
$ sudo docker run --rm --user root alexwaldrop/pipeline-tools:latest "CatVCFSummary.py --help"

Submodules

At present, Pipeline-Tools contains two tools for VCF processing:

RecodeVCF

Transforms VCF genotype calls, annotations into searchable/sortable RecodedVCF format. Complete Documentation found here.

SummarizeVCF

Summarizes variant features based on annotations contained in a VCF. Complete Documentation found here.

Project Status

Pipeline-Tools is actively under development. To request features, please contact the author listed below.

Authors

About

Collection of custom python bioinformatics utilities. Primarily for manipulating, recoding VCF files.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.9%
  • Dockerfile 1.1%