-
Notifications
You must be signed in to change notification settings - Fork 708
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Too many input files for MultiQC #100
Comments
@pditommaso - have you come across problems like this before? I guess that this is because the MultiQC process is softlinking in a lot of files which makes |
Ouch, 11Mb of input files! You can mitigate this problem using an directory as output instead files. I mean, instead of having output:
file "*_fastqc.{zip,html}" into fastqc_results let multiqc to save the files into a directory e.g. output:
file "reports" into fastqc_results |
Yes, maybe we should profile how many files each channel going into MultiQC has. I suspect that there are quite a few that aren't needed. For example - MultiQC only needs the zip file here, not the html. So could make new MultiQC-specific channels that have just these files to cut down on the number. |
I'm wondering whether @olgabot had issues with this when doing her large-scale nf-core/rnaseq experiments on AWS - any ideas? |
i ran the RNAseq pipeline on 576 fastq files and the slurm submission has also failed on the multiqc process with the same error: |
FYI: A user just encountered the same error in nf-core/eager when trying to run a 1000 sample job. If I understand the solution proposed above, in this case I don't think the directory output would necessarily work as most of the log files in this case are standalone from separate processes (rather than lots of logs from a single process). |
Had that some days ago and opened nextflow-io/nextflow#2118 for some points |
Just for the record, we've also had this issue now with |
Re the nf-core/airrflow issue @ggabernet just mentioned. I can confirm the |
The issue at Nextflow is still open, the small scale mitigation attempts did also not help us permanently either: Maybe also comment here too to make sure this gets addressed soon 👉🏻 nextflow-io/nextflow#2852 |
same issue on |
Should be better when using nextflow-io/nextflow#2852 |
I ran the RNA-seq pipeline on 360 samples, and the slurm submission of multiQC failed with
Pathname of a file, directory or other parameter too long
The files
.command.stub
and.command.sh
look normal, but.command.run
is 11Mb, with many commands forln
etc. So it might be something related to this bug: https://bugs.schedmd.com/show_bug.cgi?id=2198The text was updated successfully, but these errors were encountered: