Skip to content

aemann01/flagPCR_opticalDups

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flag PCR and optical duplicates

DOI

Script that calculates the number of pcr and optical duplicates (duplicated records that map to the same start size on the same reference with the same tile and cigar string that fall within 100 pixels of another record) in a sam file. Returns multiple output files -- nonduplicates.txt, pcr_duplicates.txt, optical_duplicates, and count_table.txt (crosstabulation of optical duplicate record groups per sample). Optionally draws a pdf figure of the location where optical duplicates fall on the tile.

Input sam file must have the sample ID as the first field separated by ":" i.e., sample1:911:HJNNYADXX:2:2215:16157:7489

python flag_duplicates.py sorted.sam

About

Calculates number of PCR and optical duplicates from a sorted sam file

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages