Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it necessary to further run consensus tools on the results of wtdbg or smartdenovo? #5

Closed
YiweiNiu opened this issue Mar 19, 2018 · 3 comments

Comments

@YiweiNiu
Copy link

Hi Jue,

I'm sorry to bother you once anain.

I found a evaluation paper and it says (paragraph 11 of "Discussion"):

...Wtdbg assemblies, which always ranked last, mostly because no consensus procedure was executed, would need additional rounds of consensus polishing to effectively compete with other assemblers.

So I'm wondering that if it is necessary to further run consensus tools after running wtdbg1.1.006, wtdbg1.2.8 and smartdenovo now, such as Racon? I know all three tools have consensus modules, and have been updated since this paper was published.

I'm working on a de novo genome assemlby project and there are very limited genomic resources to evaluate the correctness. Except PacBio data, I also have several short reads libraries, so I want to perform scaffolding based on results of wtdbg. I don't know how the errors in contigs affect the scaffolding.

Any suggestions or thoughts would be appreciated. Thank you!

Bests,
Yiwei Niu

@ruanjue
Copy link
Owner

ruanjue commented Mar 19, 2018

Thanks for interesting.

The inside consensus tool wtdbg-cns aims to provide a quick way to reduce sequencing errors. It is suggested to use Quiver and/or Pilon to polish the consensus sequences after you feel happy with the assembly. Usually, wtdbg-cns can reduce error rate down to less than 1%, which can be well-aligned by short reads.

BTW, the paper used strage parameters for wtdbg, limited to say wtdbg always ranked last, I had emailed the author just after its publish.

@YiweiNiu
Copy link
Author

Thank you for your reply!

Good to know the error rate of the wtdbg-cns results.

BTW, I know wtdbg and smartdenovo through this paper. After trying several pipelines (Canu, MECAT, miniasm, MaSuRCA, Flye etc.), wtdbg is the fastest and the best (at least the most satisfying N50) so far. I will use the results of wtdbg for downstream analysis.

Thank you for your great tools again!

@ruanjue
Copy link
Owner

ruanjue commented Mar 19, 2018

Have a good luck!

@ruanjue ruanjue closed this as completed Mar 19, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants