Seqfilter

Seqfilter is a small tool written in C on top of the excellent klib library by Heng Li. It allows filtering of fasta and fastq files based on sequence IDs and sequence length. The fasta and fastq input files may be gzipped.

Get it!

git clone --recursive https://github.com/clwgg/seqfilter

cd seqfilter
make

Usage

Seqfilter will always try to filter by IDs, since thats its intended purpose. You can however omit the ID file. In that case, you have to specify the ‘-n’ flag for negative filtering if you want to produce output, to allow output of sequences without a “valid” ID.

Negative filtering (‘-n’) means that all sequences without ID matches are kept (subsequently, if no ID file is supplied, all sequences are without ID matches).

# filter by ID file
seqfilter -i in.fq -l ids.txt -o out.fq

#filter by ID file and min length 30
seqfilter -m 30 -i in.fq -l ids.txt -o out.fq

#filter only by min length 30
seqfilter -n -m 30 -i in.fq -o out.fq

#keep only sequence called "mt"
seqfilter -i in.fa -l <(printf "mt\n") -o mt.fa

#remove sequence called "mt" from input
seqfilter -n -i in.fa -l <(printf "mt\n") -o in_no-mt.fa

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
klib @ de09fb7		klib @ de09fb7
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.org		README.org
seqfilter.c		seqfilter.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

klib @ de09fb7

klib @ de09fb7

.gitmodules

.gitmodules

LICENSE

LICENSE

Makefile

Makefile

README.org

README.org

seqfilter.c

seqfilter.c

Repository files navigation

Seqfilter

Get it!

Usage

About

Releases

Packages

Contributors 2

Languages

License

clwgg/seqfilter

Folders and files

Latest commit

History

Repository files navigation

Seqfilter

Get it!

Usage

About

Resources

License

Stars

Watchers

Forks

Languages