Rampler is a standalone module for sampling genomic sequences. It supports two modes, random subsampling of sequencing data to a desired depth (given the reference length) and file splitting to desired size in bytes.
To build rampler run the following commands:
git clone --recursive https://github.com/rvaser/rampler.git rampler
cd rampler && mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release .. && make
./bin/rampler
which will display the following usage:
usage: rampler [options ...] <mode>
<mode>
subsample <sequences> <reference length> <coverage> [<coverage> ...]
<sequences>
input file in FASTA/FASTQ format (can be compressed with gzip)
<reference length>
integer denoting length of the reference genome (or assembly)
<coverage>
integer denoting desired coverage of the subsampled sequences
split <sequences> <chunk size>
<sequences>
input file in FASTA/FASTQ format (can be compressed with gzip)
containing sequences which will be split into smaller chunks
<chunk size>
integer denoting the desired chunk size in bytes
options:
-o, --out-directory <string>
default: current directory
path in which sampled files will be created
--version
prints the version number
-h, --help
prints out the help
- gcc 4.8+ or clang 4.0+
- cmake 3.9+
- zlib
This work has been supported in part by Croatian Science Foundation under the project UIP-11-2013-7353.