Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow merging of mapped datasets for deduplication? #107

Closed
FelixKrueger opened this issue May 26, 2017 · 4 comments
Closed

Allow merging of mapped datasets for deduplication? #107

FelixKrueger opened this issue May 26, 2017 · 4 comments
Assignees

Comments

@FelixKrueger
Copy link
Owner

Can we explore the possibility to supply several BAM files to deduplicate_bismark so that they are treated as one big file? This might be a useful feature when the same sample gets sequenced several times to increase read depth, and be an alternative to going back and merging at the FastQ stage.

@FelixKrueger
Copy link
Owner Author

I have now added an option --multiple to the deduplicator to treat several input SAM/BAM files as the same sample. 285f46e.

It seems to work well for SAM/BAM files with BAM or SAM output. CRAM not tested yet.

@xiuru
Copy link

xiuru commented May 18, 2018

Hi Felix,
Can you add a new flag like 'prefix' to deduplicate_bismark? When i treat multi bam files together, i prefer to use 'prefix' to name the output files.

Thank you~~

@FelixKrueger
Copy link
Owner Author

Can't you just do something like:
mv multiple.bam prefix
once the deduplication is complete?

@xiuru
Copy link

xiuru commented May 18, 2018

Yes, i did it in this way. Sorry for my stupid question :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants