Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bamCoverage fails with large/merged files #1195

Open
kthorner opened this issue Feb 23, 2023 · 0 comments
Open

bamCoverage fails with large/merged files #1195

kthorner opened this issue Feb 23, 2023 · 0 comments

Comments

@kthorner
Copy link

Hi,

My issue is somewhat related to #662

Have tried multiple versions, most recent being 3.4.3 with Python 3.8.5

bamCoverage works the majority of the time for my pipeline. But the bam files can be "large", which is subjective but they are from merged FASTQs/bams and without filtering can be ~20 GB or more. This is not standard human or mouse but there is nothing crazy with the number of scaffolds, and contigs seem fine.

I use the following depending on whether it's RNA/DNA (these specific parameters have been in use for a while and I would like to avoid modifying for consistency if possible):

    bamCoverage --bam combined.bam -o ${BIGWIG} --binSize 20 --normalizeUsing RPGC --effectiveGenomeSize ${params.gsize_trop}  --extendReads 200 --smoothLength 60 --numberOfProcessors 4 --verbose

    bamCoverage --bam combined.bam -o ${BIGWIG} --effectiveGenomeSize ${params.gsize_laevis} --normalizeUsing BPM --binSize 20 --smoothLength 60 --numberOfProcessors 4 --verbose

Appears that the scaling factor is calculated and processing is occurring, but 24 hr+ can go by with no output. I work on a HPC and can allocate as much CPUs/RAM as necessary but nothing changes. I have seen that it is directly related to the size of the files, AKA from the same experiment the smaller ones can be processed. I have also found that downsampling with samtools can indirectly fix my problem, but I'd like to know the underlying cause. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant