Generate reports per run, per project and per lane #13

Aratz · 2024-03-28T16:00:34Z

This PR introduces MultiQC report generation by lane, by rundir and by sample group.

Closes #3

PR checklist

Issue with the previous implementation was that sometimes MULTIQC_PER_LANE would execute before the extra files were collected into `ch_multiqc_extra_files`, causing `null` to be added to the list of files passed to multiqc.

mahesh-panchal

This looks like what I was suggesting 👍🏽

docs/usage.md

mahesh-panchal · 2024-05-14T12:13:16Z

subworkflows/local/utils_nfcore_seqinspector_pipeline/main.nf

@@ -84,7 +84,7 @@ workflow PIPELINE_INITIALISATION {
        .fromSamplesheet("input") // Validates samplesheet against $projectDir/assets/schema_input.json. Path to validation schema is defined by $projectDir/nextflow_schema.json
        .map {
            meta, fastq_1, fastq_2 ->
-                def id_string = "${meta.sample}_${meta.project ?: "ungrouped"}_${meta.lane}"
+                def id_string = "${meta.sample}_${meta.group ?: "ungrouped"}_${meta.lane}"


Doesn't lane need a default value too if it's not required?

It's been removed from required.

😳 Have I then been reviewing the wrong/outdated version of this PR all the time? Because I ran gh pr 13 checkout and it is still in there for me locally ?!?

I removed it so we would be able to run on sequencing platforms without lanes, e.g. ONT.

Sure, but what about the other paths, e.g. channel where meta.group has a setting, but meta.lane has nothing (and the filter is on meta.group)?
The name will include null in it.

Could you point out where this would be an issue?

I'll note that I don't mind re-working this code into something more explicit, I simply lack the know-how as of now 😆

I guess the question is, is id used for anything important: For example with the Promethion test, the csv looks like:

sample,lane,group,fastq_1,fastq_2,rundir hg001,,r10p41_e8p2_human_runs_jkw,https://github.com/nf-core/test-datasets/raw/seqinspector/testdata/PromethION/20230505_1857_1B_PAO99309_94e07fab/fastq_pass/PAO99309_pass__94e07fab_c3641428_1.fastq.gz,,

and then the id string should be: hg001_r10p41_e8p2_human_runs_jkw_null. Does it matter that this is the case?
When you're grouping the files by group r10p41_e8p2_human_runs_jkw I guess lane information is not needed at all downstream of this?

Thanks for clarifying! From what I remember of the initial meeting, the id was intended as simply a concatenation of fields to ensure uniqueness within the pipeline run. This is still ensured even if some of the fields are null, right?

Intuitively I think having a consistent way to generate the id that sometimes contains null is preferable to setting up different conventions for generating it across different sequencing platforms.

Wouldn't it be easier to use the user-provided sample column? Could be potentially combined with a short uuid or hash to be unique in case we have samples that extend over multiple input files?

workflows/seqinspector.nf

MatthiasZepper

Great work and interesting proposal for the channel architecture. I have a different pattern in mind, but will need to try out first, if it works.

main.nf

subworkflows/local/utils_nfcore_seqinspector_pipeline/main.nf

@@ -84,7 +84,7 @@ workflow PIPELINE_INITIALISATION {
        .fromSamplesheet("input") // Validates samplesheet against $projectDir/assets/schema_input.json. Path to validation schema is defined by $projectDir/nextflow_schema.json
        .map {
            meta, fastq_1, fastq_2 ->
-                def id_string = "${meta.sample}_${meta.project ?: "ungrouped"}_${meta.lane}"
+                def id_string = "${meta.sample}_${meta.group ?: "ungrouped"}_${meta.lane}"


subworkflows/local/utils_nfcore_seqinspector_pipeline/main.nf

tests/MiSeq.main.nf.test

tests/PromethION.main.nf.test

docs/usage.md

workflows/seqinspector.nf

MatthiasZepper · 2024-05-15T16:23:53Z

workflows/seqinspector.nf

This is an impressive work. Conceptually, I wonder if another channel architecture could simplify usage. But I will need to experiment first to see if that idea would work in the first place. Hence, only the minor remarks here first.

Because those .filter() and .multiMap() chains seemed unnecessarily complex, I experimented with an own solution (including the .cross() operator, which I then dropped again), but ultimately the differences aren't large. Like you, I ended up moving the grouping variables out of the meta map and grouped over them.

Most of what makes your solution seemingly more complicated is the juggling with MultiQC config files, which I have not included in my minimal example, so it is an unfair competition. On the plus side, I have only now understood your approach, I believe :-)

#!/usr/bin/env nextflow workflow { ch_samplesheet = Channel.of( [['sample':'SampleA', 'group':'S1', 'lane':'1' ], ['/nf-core/test-datasets/raw/seqinspector/testdata/NovaSeq6000/200624_A00834_0183_BHMTFYDRXX/Sample1_S1_L001_R1_001.fastq.gz']], [['sample':'SampleB', 'group':'S1', 'lane':'2'], ['/nf-core/test-datasets/raw/seqinspector/testdata/NovaSeq6000/200624_A00834_0183_BHMTFYDRXX/SampleA_S2_L001_R1_001.fastq.gz']], [['sample':'SampleC', 'group':'S2', 'lane':'1'], ['/nf-core/test-datasets/raw/seqinspector/testdata/NovaSeq6000/200624_A00834_0183_BHMTFYDRXX/Sample23_S3_L001_R1_001.fastq.gz']], [['sample':'SampleD', 'group':'S2', 'lane':'1'], ['/nf-core/test-datasets/raw/seqinspector/testdata/NovaSeq6000/200624_A00834_0183_BHMTFYDRXX/sampletest_S4_L001_R1_001.fastq.gz']], [['sample':'Undetermined', 'group':null, 'lane':'1'], ['/nf-core/test-datasets/raw/seqinspector/testdata/NovaSeq6000/200624_A00834_0183_BHMTFYDRXX/Undetermined_S0_L001_R1_001.fastq.gz']] ) // ------------------------------------------------------ // Apply the various QC Tools to that same input channel // ------------------------------------------------------ ch_qc_outputs = some_qc_tool(ch_samplesheet) // Mix everything together: No problem as long as the meta remains intact // I tried join here, but it does not tolerate null values apparently. ch_qc_outputs = ch_qc_outputs.mix(some_other_qc_tool(ch_samplesheet)) // ---------------------------------------------------------------------------------------------------------------------------------- // At the very end, we move the grouping variables to the front and groupTuples based on combination of all possible grouping levels // ---------------------------------------------------------------------------------------------------------------------------------- // we also simplify the meta to the sample name, the only thing still needed. ch_qc_outputs_final = ch_qc_outputs.map{ meta, sample -> [ "${meta.group}", "${meta.lane}", ["${meta.sample}", sample]]}.groupTuple(by: [0,1]) // ----------------------------------------------------------------------------------------------- // Group again to the desired level (e.g. lanes) // ----------------------------------------------------------------------------------------------- ch_qc_outputs_lane_subsets = ch_qc_outputs_final.groupTuple(by: [1]) ch_qc_outputs_group_subsets = ch_qc_outputs_final.groupTuple(by: [0]) } process some_qc_tool { input: tuple val(meta), path(fastq) output: tuple val(meta), path("*.log"), emit: qc script: """ echo "QC of $fastq" > qc_stats.log """ } process some_other_qc_tool { input: tuple val(meta), path(fastq) output: tuple val(meta), path("*.log"), emit: qc script: """ echo "QC of $fastq" > qc_stats.log """ }

Co-authored-by: Matthias Zepper <6963520+MatthiasZepper@users.noreply.github.com>

github-actions · 2024-05-23T08:55:13Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 02affeb

+| ✅ 177 tests passed       |+
!| ❗  21 tests had warnings |!

❗ Test warnings:

readme - README contains the placeholder zenodo.XXXXXXX. This should be replaced with the zenodo doi (after the first release).
pipeline_todos - TODO string in README.md: TODO nf-core:
pipeline_todos - TODO string in README.md: Include a figure that guides the user through the major workflow steps. Many nf-core
pipeline_todos - TODO string in README.md: Fill in short bullet-pointed list of the default steps in the pipeline
pipeline_todos - TODO string in README.md: Add citation for pipeline after first release. Uncomment lines below and update Zenodo doi and badge at the top of this file.
pipeline_todos - TODO string in README.md: Add bibliography of tools and data used in your pipeline
pipeline_todos - TODO string in nextflow.config: Specify your pipeline's command line flags
pipeline_todos - TODO string in main.nf: Remove this line if you don't need a FASTA file
pipeline_todos - TODO string in usage.md: Add documentation about anything specific to running your pipeline. For general topics, please point to (and add to) the main nf-core website.
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in ci.yml: You can customise CI pipeline run tests as required
pipeline_todos - TODO string in awsfulltest.yml: You can customise AWS full pipeline tests as required
pipeline_todos - TODO string in test.config: Specify the paths to your test data on nf-core/test-datasets
pipeline_todos - TODO string in test.config: Give any required params for the test so that command line flags are not needed
pipeline_todos - TODO string in test_full.config: Specify the paths to your full test data ( on nf-core/test-datasets or directly in repositories, e.g. SRA)
pipeline_todos - TODO string in test_full.config: Give any required params for the test so that command line flags are not needed
pipeline_todos - TODO string in base.config: Check the defaults for all processes
pipeline_todos - TODO string in base.config: Customise requirements for specific processes.

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-seqinspector_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-seqinspector_logo_light.png
files_exist - File found: docs/images/nf-core-seqinspector_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-seqinspector_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowSeqinspector.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 1.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.max_cpus= 16
nextflow_config - Config default value correct: params.max_memory= 128.GB
nextflow_config - Config default value correct: params.max_time= 240.h
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-seqinspector_logo_light.png matches the template
files_unchanged - docs/images/nf-core-seqinspector_logo_light.png matches the template
files_unchanged - docs/images/nf-core-seqinspector_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 23.04.0, Config: 23.04.0
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (103 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains a matching 'report_comment'.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - FASTQC found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC_GLOBAL found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC_PER_LANE found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC_PER_GROUP found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC_PER_RUNDIR found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 2.14.1

Run details

nf-core/tools version 2.14.1
Run at 2024-05-30 11:22:09

kedhammar · 2024-05-23T09:04:56Z

I've tried to clean up the discussion thread and have pushed some additional commits to address simple issues. Requesting re-reviews.

MatthiasZepper

I still have a few open questions, but I also think that could be changed in a subsequent refactor if desired. Thus, I suggest merging and tackle that later?

conf/modules.config

MatthiasZepper · 2024-05-29T11:52:53Z

docs/usage.md

-TREATMENT_REP2,AEG588A5_S5_L003_R1_001.fastq.gz,
-TREATMENT_REP3,AEG588A6_S6_L003_R1_001.fastq.gz,
-TREATMENT_REP3,AEG588A6_S6_L004_R1_001.fastq.gz,
+sample  lane  group   fastq_1                                       fastq_2 rundir


Do you think that the order of the columns is advisable like this? Intuitively, I would have put all categorical variables together at the end, so that additional columns can be added easily later, if required e.g. by other sequencing technologies.

I don't have a strong opinion on this, and I feel this deserves to be discussed in a new issue/pr. This pr was not meant to change the input format. This commit to usage.md just fixes the documentation so that it's up to date with what the format actually is.

MatthiasZepper · 2024-05-29T11:59:59Z

docs/usage.md

-CONTROL_REP1,AEG588A1_S1_L003_R1_001.fastq.gz,AEG588A1_S1_L003_R2_001.fastq.gz
-CONTROL_REP1,AEG588A1_S1_L004_R1_001.fastq.gz,AEG588A1_S1_L004_R2_001.fastq.gz
+```
+run_dir


Why did you replace the exemplary filenames with this synthetic example? I think this may lead to confusion, because it may prompt people to tediously rename their files prior to a run. We should make clear that seqinspector takes the relevant information from the columns in the sample sheet and not suggest that the file names matter.

Also, judgemental adjectives like "simple" (or "difficult" etc.) should ideally be avoided in a README.

@kedhammar ?

Since @mahesh-panchal requested we visualize the directory structure, I thought it would be easier to connect the dots to the example samplesheet if all the file names in the dir also contained the information shown in the samplesheet.

I don't necessarily get the impression we are suggesting the files need to follow a particular naming convention by showing an example that is as informative as possible, but I don't feel too strongly about it.

MatthiasZepper · 2024-05-29T12:17:43Z

subworkflows/local/utils_nfcore_seqinspector_pipeline/main.nf

@@ -84,7 +84,7 @@ workflow PIPELINE_INITIALISATION {
        .fromSamplesheet("input") // Validates samplesheet against $projectDir/assets/schema_input.json. Path to validation schema is defined by $projectDir/nextflow_schema.json
        .map {
            meta, fastq_1, fastq_2 ->
-                def id_string = "${meta.sample}_${meta.project ?: "ungrouped"}_${meta.lane}"
+                def id_string = "${meta.sample}_${meta.group ?: "ungrouped"}_${meta.lane}"


Wouldn't it be easier to use the user-provided sample column? Could be potentially combined with a short uuid or hash to be unique in case we have samples that extend over multiple input files?

MatthiasZepper · 2024-05-29T13:53:36Z

workflows/seqinspector.nf

+    )
+
+    // Generate reports by group
+    multiqc_extra_files_per_group = ch_multiqc_files


The purpose and construction of the extra_files channel(s) still remains elusive to me.

Firstly, it is constructed from the ch_multiqc_files channel, that into which all the output files from the QC tools are mixed. This means it is a rather large channel that then needs to be filtered and mapped. If the only purpose is to get all group levels, I would prefer to start from the ch_samplesheet, which already should comprise all relevant information.

Secondly, all the files in ch_multiqc_extra_files (as of now) are not specific to the generated MultiQC report. ch_workflow_summary, ch_multiqc_custom_methods_description and ch_collated_versions are all global. Thus, I fail to see why this channel needs to be constructed for every grouping level instead of being reused for all MultiQC processes.

Correct me if I'm wrong, but when a process or workflow (in this case MULTIQC) is run on two or more queue channels, it will try to zip them and run on each pair of values.

If you don't duplicate the multiqc extra file to follow the same grouping as in the first channel, nextflow will run the lane 1 files against the first multiqc extra file, then the lane 2 files against the second multiqc extra file and so on.

This was very hard to get right, I'm all ears if you see a better way to do it :)

I realized I confused myself with my explanation (which for sure is sign that this should be simplified). Let me try again 😅

When the files are provided to MULTIQC_BY_LANE, lane_mqc_files.samples_per_lane looks like this: [list of samples for lane 1], [list of samples for lane 2], .... The extra multiqc files are needed for each report and need to be included in each of these lists. I thought the best solution would be to use a map over these lists and append the extra files each time, but I could never got that to work.

If you find a better way to perform this operation, I'm all for it :)

MatthiasZepper · 2024-05-29T14:12:40Z

workflows/seqinspector.nf

+        .map { meta, sample -> [ "[GROUP:${meta.group}]", meta, sample ] }
+        .groupTuple()
+        .tap { mqc_by_group }
+        .collectFile{


Is it really required to construct a custom MultiQC config just to set the output file paths? I somehow think that it should be possible to handle that in the publishDir of the module.config? Or am I missing something?

The issue here is that if you run the MultiQC module once for each lane without specifying different configs each time, it will create files with the same name regardless the lane number. Since the filename is all you have to play with when setting publishDir, it becomes very hard to sort them out into different folders.

maxulysse · 2024-05-30T08:14:42Z

README.md

 First, prepare a samplesheet with your input data that looks as follows:

 `samplesheet.csv`:

 ```csv
-sample,fastq_1,fastq_2
-CONTROL_REP1,AEG588A1_S1_L002_R1_001.fastq.gz,AEG588A1_S1_L002_R2_001.fastq.gz
+sample,lane,group,fastq_1,fastq_2,rundir


what about an extra "individual" field for when you have multiple samples from the same patient (thinking cancer sample sarek style)?

or is this what you mean by group?

Aratz · 2024-05-30T11:33:49Z

I'll merge this now, thank you all for your reviews and comments. There are still some open discussions that I think are worth addressing but are not critical to this feature, we can keep discussing them here and in subsequent PRs.

Aratz added 3 commits March 28, 2024 16:14

Generate reports per lane, group and rundir

a31040e

Improve formatting

0da5870

Improve output sorting

d233d8f

Aratz self-assigned this Mar 28, 2024

This comment was marked as outdated.

Sign in to view

Aratz added 3 commits April 8, 2024 14:13

Use group instead of project

307e43c

Fix output channel

627cf94

Fix linting

c9ba028

This comment was marked as outdated.

Sign in to view

Aratz changed the base branch from master to dev April 8, 2024 13:30

Aratz added 2 commits April 8, 2024 15:52

Give credits back to NGI

2cfc91d

Fix file names

fbfb02d

This comment was marked as resolved.

Sign in to view

Aratz and others added 16 commits May 3, 2024 09:50

Set up tests

8a19929

point test conf upstream

eba628e

project -> group

23f69d1

project -> group

1ebf3f1

make lane non-compulsory

b0bf471

remove unused file

9e0eca3

revamp nf-test, run once for each sequencing platform

98e60bf

Update modules and subworkflows

d95c660

Fix multiqc extra files

e0527cc

Issue with the previous implementation was that sometimes MULTIQC_PER_LANE would execute before the extra files were collected into `ch_multiqc_extra_files`, causing `null` to be added to the list of files passed to multiqc.

Add test snapshots

42159a1

Remove unused module configuration

ef61f9f

Update usage docs and restore example samplesheet

51b01e9

Update output docs

7c7f31f

Update changelog

1c4f6e0

Update samplesheet in readme file

211bfaf

Merge remote-tracking branch 'origin/dev' into multiqc_multireport

acc82d0

Aratz marked this pull request as ready for review May 14, 2024 10:51

Aratz requested review from MatthiasZepper, mahesh-panchal and kedhammar May 14, 2024 10:52

mahesh-panchal reviewed May 14, 2024

View reviewed changes

MatthiasZepper reviewed May 15, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

kedhammar and others added 3 commits May 16, 2024 16:00

Use testdata base path param in tests/MiSeq.main.nf.test

2303db9

Co-authored-by: Matthias Zepper <6963520+MatthiasZepper@users.noreply.github.com>

Use testdata base path param in tests/NovaSeq6000.main.nf.test

1ea8ac0

Co-authored-by: Matthias Zepper <6963520+MatthiasZepper@users.noreply.github.com>

Use testdata base path param in tests/PromethION.main.nf.test

048765f

Co-authored-by: Matthias Zepper <6963520+MatthiasZepper@users.noreply.github.com>

This comment was marked as resolved.

Sign in to view

MatthiasZepper and others added 6 commits May 17, 2024 11:38

Make the tests work with 'pipelines_testdata_base_path' parameter.

4329bb9

update snapshots, add nf-test.log to gitignore

3ff8503

visualize example run dir corresponsing to samplesheet

8ac4d76

naming fixes

84c9b3d

nf-core sync

aaf17b6

nf-core fixes

e6dfea9

kedhammar requested review from mahesh-panchal and MatthiasZepper May 23, 2024 09:05

kedhammar approved these changes May 23, 2024

View reviewed changes

MatthiasZepper approved these changes May 29, 2024

View reviewed changes

mahesh-panchal approved these changes May 30, 2024

View reviewed changes

maxulysse reviewed May 30, 2024

View reviewed changes

Improve publishDir logic

02affeb

Aratz merged commit 9cb1d68 into nf-core:dev May 30, 2024
4 checks passed

Generate reports per run, per project and per lane #13

Generate reports per run, per project and per lane #13

Conversation

Aratz commented Mar 28, 2024 • edited Loading

PR checklist

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as resolved.

This comment was marked as resolved.

mahesh-panchal left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as resolved.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MatthiasZepper left a comment

Choose a reason for hiding this comment

This comment was marked as resolved.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

github-actions bot commented May 23, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

✅ Tests passed:

Run details

kedhammar commented May 23, 2024

MatthiasZepper left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aratz commented May 30, 2024

Aratz commented Mar 28, 2024 •

edited

Loading

github-actions bot commented May 23, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️