New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimisation of parameters #2
Comments
Hi Zac, In fact, wtdbg is degined to be able to assemble a huge genome within one day, SMARTdenovo might get better assemblies in small genomes. Best, |
Thanks for the response Jue, I'll try out your suggestions as soon as I get the chance and I'll let you know how it goes. Zac. |
Hi again, I've managed to finish running a few variations based on your suggestions. The first thing I noticed was that I made a mistake regarding what the final output file was. I was incorrectly using the .ctg.lay.fa file as the final output rather than the .map.fa file. I might suggest changing the readme file to make this clearer, since the .map.fa file isn't mentioned on there. With respect to the outputs, I'll list the commands and statistics here for anyone else interested in using this program. Default
Suggestion 1
Suggestion 1.1
Suggestion 2
Suggestion 2.1
At least for my relatively small genome, the settings of suggestion 1/1.1 seem to be giving the most contiguous assembly which is comparable to SMARTdenovo but not quite as good. The run time is still much faster than SMARTdenovo although the memory usage is much higher when not running at default settings. I'll consider playing around with the program further, but for now I think gene annotation is the next thing I need to try to see how the two programs differ from each other and compare to other assemblers. Zac. |
Hi, Thank you for your discussions about the parameter optimisation. It's very helpful. I've tried to use I want to share my commands and the basic stats I got. smartdenovowith
wtdbg 1.1.006with
wtdbg 1.2.8run1, with defalult
|
Thanks, @YiweiNiu |
Hi Jue, Sorry for the late reply. The test was just finished. with
with
Thank you for your help! Bests, |
In personal feeling, I like wtdbg-1.2.8 more than SMARTdenovo and wtdbg-1.1.006. Anyway, thanks for sharing experiments! |
Do you have any ideas about improving the assembly (though I know this largely depends on the data/species)? If necessary, I will upload the log files. Now I'm not sure what to do next... I've tried several assembly pipelines (miniasm, MECAT, Canu etc.). Thank you! |
N50 contig of ~500kb is enough for following genomic analysis, also enough for further scaffolding. Now, I suggest you to check mis-assemblies, and make a final dicision. |
Thank you very much for your help and for your great tools! I'll try and paste something useful here. |
Can you please clarify how the Thanks! |
a seed = + |
a seed = |
Hi,
I am wondering if you could give me a few tips for optimising the assembly. I have previously used your SMARTdenovo assembler, and received a quite good assembly. Parameters were default, except I reduced minimum length cut-off to 2500. Stats are:
I have tried three different parameter combinations with wtdbg, but haven't been able to get an assembly as contiguous. The default parameters received the following stats:
I next tried two variants of a "maximum sensitivity" combination (at least, as I understand it). The first variant included the arguments "-k 0 -p 17 -S 2 --edge-min 2 --rescue-low-cov-edges". The second was the same, but --tidy-reads was set to 2500. The statistics for these two in their respective order is below:
I was wondering if you had any ideas for how I might be able to improve the program's performance, or if SMARTdenovo might just be better suited for my particular genome?
Thanks,
Zac.
The text was updated successfully, but these errors were encountered: