Skip to content

Code for removing PCR duplicates from SAM files which can be the result of PCR amplification bias.

Notifications You must be signed in to change notification settings

Zach-Sisson-1/Deduper-Zach-Sisson-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deduper

The utility of this script will be to address the issue of PCR duplicates in RNA-seq workflows. PCR duplicates can be the result of a bias in PCR amplification and if unfiltered, can lead to downstream expression bias. The script is designed for use after sequencing alignment, and will input a SAM file of uniqely mapped, single-end reads, along with a list of Unique Molcular Indexes (UMIs) of length 8, and ouput the SAM file, retaining only a single copy of each set of PCR duplicates.

Final script titled Deduper.py.

About

Code for removing PCR duplicates from SAM files which can be the result of PCR amplification bias.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages