Skip to content

skoren/PilonGrid

Repository files navigation

IMPORTANT: THIS PIPELINE IS DEPRECATED

We recommend using the VGP pipeline based on FreeBayes instead of Pilon for polishing.

PilonGrid

The distribution is a parallel wrapper around the Pilon framework The pipeline is composed of bash scripts, an example mapping.fofn which shows how to input your fastq files (you give paths to the R1 file), and how to launch the pipeline.

The current pipeline has been designed to run on the SGE scheduling system and has hard-coded grid resource request parameters. You must edit pilon.sh to match your grid options. It is, in principle, possible to run on other grid engines but will require editing all shell scripts to not use SGE_TASK_ID but the appropriate variable for your grid environment and editing the qsub commands in pilon.sh to the appropriate commands for your grid environment.

To run the pipeline you need to:

  1. You must have a working java version (1.8+ recommended). Run javac *.java in the working folder

  2. You must have a working installation of Pilon and symlink the jar file from wherever you have placed PilonGrid. You must also have a working bwa mem and samtools in your path.

  3. Create a mapping.fofn input file specifying which fastq files you want to use as input. They can be gzipped/bzipped. The pipeline assumes that paired-end reads are used with one file named *_R1_* and the other name *_R2_*. It will fail if this is not the case.

  4. Run the pipeline specifying the input fasta assembly.

sh pilon.sh asm.fasta

The pipeline is very rough and has undergone limited testing so user beware.

CITE

If you find this pipeline useful, please cite the original Pilon paper:
Walker BJ et. al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One, 2014.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published