-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
classify-consensus-blast: expose num_threads #155
Comments
please someone implement this, our jobs are taking forever against SILVA due to blast being single threaded. I would do the edit myself if I knew enough of python (sob) |
Note: @cherman2 is going to work on this issue. |
I see that in past code, there was a num_thread that was set to 1 (num_threads: str=1), in the current version the numThread variable does not exist. I can see in the history that it was removed because "The 'num_threads' parameter had no effect with the mode that 'blastn' is invoked in." Should num_threads be reimplemented to equal 1, and be exposed, or should num_threads be an amount that is decided by the user? If so, what is the max amount of threads the code should allow? |
Awesome, thanks for the sleuthing, @cherman2! 🔍 Based on the issue/PR you linked to (#77), it is safe to assume that this issue (#155) is not actually something that can be accomplished (sorry @splaisan, our hands are tied), since What do you think about pivoting this issue and turning it into a documentation issue? Specifically, would you be able to add a comment to the source code where the Please let me know if you have any questions, or if I can clarify anything for you. Thanks! PS - I confirmed that blastn 2.6.0 (which we are currently using in q2-feature-classifier) still emits this warning |
Here is my documentation for why issue qiime2#155(qiime2#155) is not accomplishable at this time. Issue qiime2#77 (qiime2#77) was fixed by removing -num_threads as it was ignored when subject was specified. Since num_threads is currently not a variable it can not be exposed.
@gregcaporaso and @thermokarst. I created a fork of _blast.py with my proposed documentation of the issue. Let me know if there is anything else I need to do on my end! |
Thanks @cherman2! Please open up a GitHub Pull Request (second ref) to get your changes ready for review. Thanks! |
Done! @thermokarst |
Hi, is there any chance you could wrap makeblastdb, so that classify-consensus-blast first creates a blast-indexed database starting from the fasta file included in the FeatureData[Sequence] artifact, and then performs the alignment (specifying -db instead of -subject), so that it can be run with multi-threading? |
Closed in #157 |
Improvement Description
Expose num_threads to enable parallelization in classify-consensus-blast
Current Behavior
num_threads is not exposed.
Proposed Behavior
Expose it.
The text was updated successfully, but these errors were encountered: