fast_downsample

Single python script for fast downsample fasta/fastq files to specific data size. Also works with gizp compressed files(*.gz) The code requires python version after 3.5. No other environment is needed.

linux run example

python downsample.py --expected 10000000 --ifsort false --input /path-to-file/input.fasta --output /path-to-outdir/output.fasta
# --expected : number of base of result
# --ifsort : if the file is in order or not
# --input : input .fasta/.fastq/.gz data
# --output : oputput file, currently only supports output as .fatsq/.fasta

slurm submission script example

Pysbatch: https://github.com/luptior/pysbatch

# change the input file path and other variable in shell script if necessary
sh slurm_example.sh
python slurm_example.sh

further

add support to upsample
simpler method for normalize a batch of sequence files to same size/depth

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
README.md		README.md
downsample.py		downsample.py
slurm_example.py		slurm_example.py
slurm_example.sh		slurm_example.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fast_downsample

linux run example

slurm submission script example

further

About

Releases

Packages

Languages

luptior/fast_downsample

Folders and files

Latest commit

History

Repository files navigation

fast_downsample

linux run example

slurm submission script example

further

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages