FelixKrueger / Bismark Public
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
no coverage file produced #403
Comments
|
Hi @imerelli I really can't say why this isn't working, I have slightly modified the command and ran it over here: And it runs through to the coverage step just fine: This is also using version 0.23.0. So in conclusion, it seems to be something on your side, but I am not sure what it could be? Could you just take a small number of lines from the initial BAM file (say 1000 lines), and see that works? Maybe drop some of the options, such as |
|
Hi Felix, |
|
The number of cores is only relevant for the extraction part (which seems to have been completed fine in your case). More cores should generally mean it will be faster. The bedGraph conversion step is run on a single core and is indeed memory dependent, and will run with a default of 2GB, and cache the rest out to disk (the folder you specified with You should be able to start the bedGraph conversion directly, starting from the CpG* files, just type |
|
Ok, I will go on with the tests and let you know. Is there a way, starting from the CpG* files, to generate the coverage files? |
|
yes indeed, e.g.:
|
|
Ok, I get it. This command makes both the .bedgraph.gz and the .cov.gz files. |
|
Hi Felix, it was a memory problem. Increasing the buffer to 10G the problem disappeared (both computing directly with bismark_methylation_extractor and using bismark2bedGraph). Nonetheless, it would be interesting to understand why switching from memory to disk in sorting files (if I get correctly) gives problems. |
|
Excellent, glad this is now working. If you ever find out more, feel free to post it here. |
H Felix,
while in previously analysis with bismark_methylation_extractor (v0.20.0) using a command line such as:
bismark_methylation_extractor -p -o /analysis/results/ --gzip --bedGraph --multicore 4 ETRs_1_deduplicated.bam
the cov file was always generated (which is essential in order to use R/methylkit for the statistical analysis), now using the same command line with bismark_methylation_extractor (v0.23.0) the coverage file in not generated anymore. The computation seems ok, since all the other files are present (CpG_OT, CpG_OB, CHH_OT, CHH_OB, CHG_OB) and also the .M-bias.txt and splitting_report.txt seems fine. Below the end of the log, which also looks good.
Any hints of what is happening?
C methylated in CpG context: 73.4%
C methylated in CHG context: 0.6%
C methylated in CHH context: 0.5%
Merging individual M-bias reports into overall M-bias statistics from these 4 individual files:
ETRs_1_deduplicated_splitting_report.txt.1.mbias
ETRs_1_deduplicated_splitting_report.txt.2.mbias
ETRs_1_deduplicated_splitting_report.txt.3.mbias
ETRs_1_deduplicated_splitting_report.txt.4.mbias
Determining maximum read lengths for M-Bias plots
Maximum read length of Read 1: 101
Maximum read length of Read 2: 96
Perl module GD::Graph::lines is not installed, skipping drawing M-bias plots (only writing out M-bias plot table)
Determining maximum read lengths for M-Bias plots
Maximum read length of Read 1: 101
Maximum read length of Read 2: 96
Perl module GD::Graph::lines is not installed, skipping drawing M-bias plots (only writing out M-bias plot table)
Deleting unused files ...
CpG_OT_ETRs_1_deduplicated.txt.gz contains data -> kept
CpG_CTOT_ETRs_1_deduplicated.txt.gz was empty -> deleted
CpG_CTOB_ETRs_1_deduplicated.txt.gz was empty -> deleted
CpG_OB_ETRs_1_deduplicated.txt.gz contains data -> kept
CHG_OT_ETRs_1_deduplicated.txt.gz contains data -> kept
CHG_CTOT_ETRs_1_deduplicated.txt.gz was empty -> deleted
CHG_CTOB_ETRs_1_deduplicated.txt.gz was empty -> deleted
CHG_OB_ETRs_1_deduplicated.txt.gz contains data -> kept
CHH_OT_ETRs_1_deduplicated.txt.gz contains data -> kept
CHH_CTOT_ETRs_1_deduplicated.txt.gz was empty -> deleted
CHH_CTOB_ETRs_1_deduplicated.txt.gz was empty -> deleted
CHH_OB_ETRs_1_deduplicated.txt.gz contains data -> kept
The text was updated successfully, but these errors were encountered: