Skip to content

dedupl fastq

masikol edited this page Oct 18, 2022 · 3 revisions

dedupl-fastq

Description

This script is designed for deduplication of fastq files.

It looks through all fastq files in the working directory and writes deduplicated reads to directory fastq_deduplicated nested in the working directory. Eventually, all reads in result directory are unique.

The script considers sequences duplicated if their IDs are equal.

Dependencies

The script is written in Python, so you need Python interpreter (version 3.X) to use it. Here you can download Python.

Usage:

No arguments, just run it to process all fastq files in the working directory:

./dedupl-fastq.py
Clone this wiki locally