Quark

semi-reference-based short read compression

Assumption

The read files are in gzipped format i.e. they should be like .. 1.fastq.gz and 2.fastq.gz

The software is tested on paired end and single end data on bash compatible shell (redirection might not work with fish kind of ad on), single end support will be added to the "quark.sh" script soon.

Dependency

Quark depends on plzip for downstream compression. More information about Plzip and installation guide can be found here.

Compile

$git clone www.github.com/COMBINE-lab/quark.git
$cd quark
$mkdir build
$cd build
$cmake ..
$make
$cd ..

##Running Quark

To see the options

$./quark.sh -h

To build the index with kmer size k

snakemake -s quark.snake make_index --config out="<output dir>" fasta="<fasta file>" kmer=<#k>

To Encode

Single End

snakemake -s quark.snake encode --config out="<output dir>" index="<index dir>" r="<mate>" p=<#threads> lib="single" quality=0

Paired end

snakemake -s quark.snake encode --config out="<output dir>" index="<index dir>" m1="<mate1>" m2="<mate2>" p=<#threads> lib="paired" quality=0

To Decode

snakemake -s quark.snake decode --config in="<in dir>" out="<out dir>" lib="paired/single" quality=0

To check the encoded and decoded sequences are same !! (it is lossless)

$./check_pair.sh <original left end> <original right end> <quark left end> <quark right end>

Link to the preprint

Quark enables semi-reference-based compression of RNA-seq data by Hirak Sarkar, Rob Patro

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
Mince-Binaries-0.6.1		Mince-Binaries-0.6.1
benchmark		benchmark
cmake		cmake
external		external
include		include
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE.md		LICENSE.md
README.md		README.md
all_tools.sh		all_tools.sh
batchquark.snake		batchquark.snake
check_pair.sh		check_pair.sh
qimage.001.png		qimage.001.png
quark.sh		quark.sh
quark.snake		quark.snake
runquark.sh		runquark.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quark

Assumption

Dependency

Compile

To build the index with kmer size k

To Encode

Single End

Paired end

To Decode

To check the encoded and decoded sequences are same !! (it is lossless)

Link to the preprint

About

Releases

Packages

Languages

License

jhidalgo-lopez/quark

Folders and files

Latest commit

History

Repository files navigation

Quark

Assumption

Dependency

Compile

To build the index with kmer size k

To Encode

Single End

Paired end

To Decode

To check the encoded and decoded sequences are same !! (it is lossless)

Link to the preprint

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages