HUMID: reference free FastQ deduplication

https://readthedocs.org/projects/HUMID/badge/?version=latest

HUMID is a tool to quickly and easily remove duplicate reads from FastQ files, with or without UMIs.

Installation

You can install HUMID from conda

conda install -c bioconda humid

If you want to, you can also install HUMID from source.

Usage

Both the input and output of HUMID are plain FastQ files, no alignment required! This means that you can use HUMID to remove duplicates as a pre-processing step before starting your analysis. If your project was sequenced without UMIs, or if the UMIs are present in the headers of the FastQ reads (as is done by BCL Convert), you can use the following command:

humid forward.fastq.gz reverse.fastq.gz

If the UMIs are located in a separate FastQ file use

humid forward.fastq.gz reverse.fastq.gz umi.fast.gz

For other use cases, we recommend that you use fastp to move the UMIs to the header of the forward FastQ file before deduplicating them with HUMID.

Please see the usage section of the documentation for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
.github		.github
docs		docs
lib		lib
src		src
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
LICENSE.md		LICENSE.md
README.rst		README.rst

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HUMID: reference free FastQ deduplication

Installation

Usage

About

Releases 5

Packages

Contributors 2

Languages

License

jfjlaros/HUMID

Folders and files

Latest commit

History

Repository files navigation

HUMID: reference free FastQ deduplication

Installation

Usage

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 2

Languages

Packages