Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing about 30% of ZMWs in output #42

Closed
gevro opened this issue Sep 10, 2022 · 2 comments
Closed

Missing about 30% of ZMWs in output #42

gevro opened this issue Sep 10, 2022 · 2 comments

Comments

@gevro
Copy link

gevro commented Sep 10, 2022

Hi,
I'm running the below and found that ~30% of ZMWs are missing from the deepconsensus FASTQ output, even though I see them in the input CCS bam and input subreads BAM:

deepconsensus_0.3.1.sif deepconsensus run --batch_size=1024 --batch_zmws=100 --cpus 4 --max_passes 20 --subreads_to_ccs=subreads.bam --ccs_bam=ccs.bam --checkpoint=/model/checkpoint

Is this expected behavior? Is there any way to see in the logs why many ZMWs are not in the output?

PS: I don't think this is due to deepconsensus output reads having lower quality than the threshold of Q20, because I'm using ccs BAM input with --min-rq=0.99. I know you recommend lower than that, but if anything, inputting ccs BAM with reads > ccs rq 0.99 should not have 30% of reads failing to have a consensus from Deepconsensus.

PPS: I manually ran deepconsensus on the ccs and subreadstoccs of one ZMW that failed to be output by deepconsensus and I got this: failed_quality_filter=1. In CCS, the rq of this ZMW was rq:f:0.994125.
Does Deepconsensus have a more stringent definition of read quality, such that it outputs fewer ZMWs than CCS?

@danielecook
Copy link
Collaborator

By default, DeepConsensus filters reads at >=Q20.

Try running using --min_quality=0.

@pichuan
Copy link
Collaborator

pichuan commented Sep 21, 2022

Hi @gevro , hopefully @danielecook 's answer resolved your issue.
I'm closing this. If you have more questions, please feel free to open another issue or reopen. Thank you!

@pichuan pichuan closed this as completed Sep 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants