-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bismark_methylation_extractor multicore option #52
Comments
Hi Albert, the multi-core option works on extracting the methylation information from the BAM file. The subsequent bedGraph sorting and cytosine report step do in fact run on a single core. It would appear that your status report is generated at the cytosine report stage, and 13% might be just about 1 core at 100% on an 8 core machine, would that make sense? This step iterates through the genomic sequence, identifies every C and looks up whether or not the C was found in the coverage file. I theory this could be made more parallel, e.g. by doing it for each chromosome separately, but we did't find it very pressing to work on the efficiency of this step. |
Gotcha. So I expect 100% usage in the bam reading step, then after that, 1 Definitely going to create a new feature request for this ;-) On Fri, Jul 1, 2016 at 9:31 AM, FelixKrueger notifications@github.com
|
Considering this question resolved. |
Hi,
I am running bismark_methylation_extractor on a machine instance with 8 CPUs and 15GB of RAM. I set it up so that it runs with
--multicore 8
option and 10G of RAM for sorting.I am looking at the logs (see below), and it's only using 13% CPU. Could it be I am missing a parameter here? Is there a way to make it faster or more efficient?
The text was updated successfully, but these errors were encountered: