Skip to content

sym24/RuffusPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#A Targeted Clinical Pipeline Built in Ruffus

##Required installation: Python 2.7.8 Python Ruffus Library TransAbyss BioBloomTool Gmap ##Input files: Full path of sequencing libraries contained in a text file. If the reads are in paired-end mode, they should be ulternating.

	For example:
	/abs/path/to/file/library1_1.fq
	/abs/path/library1_2.fq
	/abs/path/library2_1.fq
	/abs/path/library2_2.fq

###Usage for fastq_split.py Added this file to split fastq output from BioBloomTool 2.0.7, which will output matched read pairs into stdout with command '-d [FIlTERNAME]'

Help display:

usage: fastq_split.py [-h] [-p PREFIX] [-g] [-f]

Write Interlaced Fastq Standard Output Into Splited Files

optional arguments:
  -h, --help            show this help message and exit
  -p PREFIX, --prefix PREFIX
                        Provide output file path and prefix Format:
                        /path/prefix_1, /path/prefix_2 Default path: /home/ymi
                        ngsun/work/project_pipeline/ruffus/matchedReads
  -g, --gzip            Specify gzip output file. If not specified, files will
                        not be gzip.
  -f, --force           Overwriter existing file

###Usage for geneExtract.py geneExtract.py will take interested gene names and find out the longest contig coordinates using GTF file. It will then extract sequence from reference genome and build bloom filter out of targeted genes.

Help display:

usage: geneExtract.py [-h] [-p PREFIX] -t TargetGeneFile -gtf GTF_file -ref
                      Ref Genome [-f]

Extract target genes and generate bloom filter

optional arguments:
  -h, --help            show this help message and exit
  -p PREFIX, --prefix PREFIX
                        Provide output file path. Default path:
                        /home/ymingsun/work/project_pipeline/ruffus
  -t TargetGeneFile, --target TargetGeneFile
                        Provide absolute path of text file containing target
                        gene names
  -gtf GTF_file         Provide absolute path of gtf_file
  -ref Ref Genome       Provide absolute path of the reference file
  -f, --force           Overwriter existing file

About

Clinical Pipeline

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages