Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatic filter of ENCODE blacklist #85

Open
marziacremona opened this issue Jun 28, 2019 · 0 comments
Open

Automatic filter of ENCODE blacklist #85

marziacremona opened this issue Jun 28, 2019 · 0 comments

Comments

@marziacremona
Copy link
Contributor

I think it would be great to add an automatic filtering of known artifacts, such as the ones present in ENCODE blacklists.

Functional genomics experiments based on next-gen sequencing (e.g. ChIP-seq, MNase-seq, DNase-seq, FAIRE-seq) that measure biochemical activity of various elements in the genome often produce artifact signal in certain regions of the genome. It is important to keep track of and filter artifact regions that tend to show artificially high signal (excessive unstructured anomalous reads mapping).

MSPC could have an additional argument (say, --blacklist ) so that the user can decide not to filter anything (--blacklist none), or to filter out every peak overlapping the ENCODE blacklist corresponding to the genome of reference (--blacklist hg19, --blacklist GRCh38 and so on. The complete list of available blacklists can be found here).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants