Keeping low number of duplicates #76

phoebe460 · 2024-04-11T18:18:41Z

Hi EPIC2 Developers,

First off, thank you for creating a great peak calling tool. I am planning to use this for my own analysis for ChIP-seq data actually. In that case, I am wondering if there is a way to keep duplicates using --keep-duplicates but setting it to just a low number for instance like 1 instead of just to True, which would remove the majority of PCR duplicates found but still keep a low number of duplicates?

A similar thing can be done using their keep duplicates flag in MACS3 as follows:

--keep-dup

It controls the MACS3 behavior towards duplicate tags at the exact same location – the same coordination and the same strand. You can set this as auto, all, or an integer value. The auto option makes MACS3 calculate the maximum tags at the exact same location based on binomial distribution using 1e-5 as p-value cutoff; and the all option keeps every tag. If an integer is given, at most this number of tags will be kept at the same location. The default is to keep one tag at the same location. Default: 1

If you can clarify this for me before I start using your program, then that would be greatly appreciated.

Thank you,
Phoebe

The text was updated successfully, but these errors were encountered:

endrebak · 2024-04-12T07:12:01Z

This is something I could consider. I think it makes sense. It should not be hard to allow keeping some duplicates, even though it will make the runtime a bit longer.

phoebe460 · 2024-04-12T19:11:15Z

Hi @endrebak,

Thank you for your reply back. Sure, if there is anyway you could support this kind of implementation into your tool, then that would be awesome. It will definitely help me with my own analysis for the ChIP-seq dataset I am currently working with needs to have some but not all duplicates retained.

Keep me posted,
Phoebe

endrebak · 2024-04-13T05:58:16Z

I will not have the time to do this anytime soon. You can preprocess the data yourself and use --keep-duplicates

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keeping low number of duplicates #76

Keeping low number of duplicates #76

phoebe460 commented Apr 11, 2024

endrebak commented Apr 12, 2024

phoebe460 commented Apr 12, 2024 •

edited

Loading

endrebak commented Apr 13, 2024

Keeping low number of duplicates #76

Keeping low number of duplicates #76

Comments

phoebe460 commented Apr 11, 2024

endrebak commented Apr 12, 2024

phoebe460 commented Apr 12, 2024 • edited Loading

endrebak commented Apr 13, 2024

phoebe460 commented Apr 12, 2024 •

edited

Loading