-
Notifications
You must be signed in to change notification settings - Fork 0
dedupl fastq
masikol edited this page Oct 18, 2022
·
3 revisions
This script is designed for deduplication of fastq files.
It looks through all fastq files in the working directory and writes deduplicated reads to directory fastq_deduplicated
nested in the working directory. Eventually, all reads in result directory are unique.
The script considers sequences duplicated if their IDs are equal.
The script is written in Python, so you need Python interpreter (version 3.X) to use it. Here you can download Python.
No arguments, just run it to process all fastq files in the working directory:
./dedupl-fastq.py