Continue analysis even when individual files fail the filtering threshold #641

sminot · 2023-09-26T23:30:44Z

This PR addresses the issue that when samples fail to pass the filtering threshold, they will:

Raise an error in DADA2_FILTANDTRIM when no files are written out, and
Cause no outputs to be written when the id element fails to match on future join steps

I've added a test case to reproduce the bug, and the code I've added should address it.

This should be a better solution than what is outlined in #638 (which I'm now closing)

PR checklist

…hold

sminot · 2023-09-26T23:31:03Z

Test data in nf-core/test-datasets#1008

github-actions · 2023-09-26T23:34:21Z

`nf-core lint` overall result: Failed ❌

Posted for pipeline commit 8ca115e

+| ✅ 148 tests passed       |+
#| ❔   3 tests were ignored |#
!| ❗   2 tests had warnings |!
-| ❌   5 tests failed       |-

❌ Test failures:

files_unchanged - CODE_OF_CONDUCT.md does not match the template
files_unchanged - .github/CONTRIBUTING.md does not match the template
files_unchanged - .github/workflows/linting.yml does not match the template
files_unchanged - lib/NfcoreTemplate.groovy does not match the template
multiqc_config - 'assets/multiqc_config.yml' does not contain a matching 'report_comment'.

❗ Test warnings:

readme - README did not have a Nextflow minimum version badge.
schema_lint - Parameter input is not defined in the correct subschema (input_output_options)

❔ Tests ignored:

files_exist - File is ignored: conf/igenomes.config
files_unchanged - File ignored due to lint config: .gitattributes
actions_ci - actions_ci

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-ampliseq_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-ampliseq_logo_light.png
files_exist - File found: docs/images/nf-core-ampliseq_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: lib/nfcore_external_java_deps.jar
files_exist - File found: lib/NfcoreTemplate.groovy
files_exist - File found: lib/Utils.groovy
files_exist - File found: lib/WorkflowMain.groovy
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: lib/WorkflowAmpliseq.groovy
files_exist - File found: modules.json
files_exist - File found: pyproject.toml
files_exist - File not found check: Singularity
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: docs/images/nf-core-ampliseq_logo.png
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 2.7.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
files_unchanged - .prettierrc.yml matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-ampliseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-ampliseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-ampliseq_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - lib/nfcore_external_java_deps.jar matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
files_unchanged - pyproject.toml matches the template
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Zenodo placeholder was replaced with DOI.
pipeline_todos - No TODO strings found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (254 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - 'assets/multiqc_config.yml' follows the ordering scheme of the minimally required plugins.
multiqc_config - 'assets/multiqc_config.yml' contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'

Run details

nf-core/tools version 2.10
Run at 2023-09-27 17:57:10

d4straub · 2023-09-27T06:28:39Z

Thanks! That looks indeed elegant, going to test it today and give feedback.

d4straub

Awesome, thanks that was a good idea to change the ouput of dada2_filtntrim in that way!

The metadata however makes the test_failed fail for me, easily fixable though.

Changelog could also get an update, you can add yourself to the contributors in the readme credits section as well if you want.

subworkflows/local/dada2_preprocessing.nf

conf/test_failed.config

nextflow.config

sminot · 2023-09-27T17:48:39Z

Ok, I think I'm in good shape now, @d4straub

Test data is merged
Added to CI
Added to CHANGELOG
Made the channel filtering code more explicit as you suggested

Let me know what you think!

d4straub

Great thanks, all looks good.

However, there is one more problem (apologies for not having predicted this earlier, that is a relatively new feature). For the new way of testing with github, at least a "failed.nf.test" in tests/pipeline/ seems required. That file specifies tests that output files are present, and can in conjunction with "failed.nf.test.snap" even check for md5sums. I think the presence/md5sum is only needed for central files, here probably
cutadapt/cutadapt_summary.tsv
barrnap/summary.tsv
dada2/DADA2_table.tsv
overall_summary.tsv
But I can add that also later, so I approve anyway (I think you cannot merge the PR with failing tests, let me know if you want me to merge it). However, it would be nice if you want to still add that last piece!

sminot · 2023-09-28T16:15:52Z

The test validation piece is not something I'm familiar with, so I might just vote for merging as-is and adding in those additional files in a subsequent PR.

It does appear that I cannot merge with the tests failing. If you could squash and merge that would be greatly appreciated!

Thanks also for all your help as I was getting this together @d4straub. My first real contribution to nf-core!

Continue analysis even when individual files fail the filtering thres…

5802b0b

…hold

sminot requested a review from d4straub September 26, 2023 23:30

sminot mentioned this pull request Sep 26, 2023

Adding test data for ampliseq with failed samples nf-core/test-datasets#1008

Merged

sminot mentioned this pull request Sep 26, 2023

Catch error case where 0 reads pass filtering and params.ignore_failed_trimming = True #638

Closed

10 tasks

d4straub reviewed Sep 27, 2023

View reviewed changes

subworkflows/local/dada2_preprocessing.nf Outdated Show resolved Hide resolved

conf/test_failed.config Outdated Show resolved Hide resolved

nextflow.config Show resolved Hide resolved

sminot added 4 commits September 27, 2023 10:39

Add to CI

c7e0ef6

More verbose mapping

3eef426

Add to CHANGELOG

cb23b15

Fix remote path to test data

1d8fff4

Merge branch 'dev' into skip_missing_data

8ca115e

d4straub approved these changes Sep 28, 2023

View reviewed changes

d4straub merged commit 22969a7 into nf-core:dev Sep 28, 2023
15 of 17 checks passed

sminot deleted the skip_missing_data branch September 28, 2023 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continue analysis even when individual files fail the filtering threshold #641

Continue analysis even when individual files fail the filtering threshold #641

sminot commented Sep 26, 2023

sminot commented Sep 26, 2023

github-actions bot commented Sep 26, 2023 •

edited

❌ Test failures:

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

d4straub commented Sep 27, 2023

d4straub left a comment

sminot commented Sep 27, 2023

d4straub left a comment •

edited

sminot commented Sep 28, 2023

Continue analysis even when individual files fail the filtering threshold #641

Continue analysis even when individual files fail the filtering threshold #641

Conversation

sminot commented Sep 26, 2023

PR checklist

sminot commented Sep 26, 2023

github-actions bot commented Sep 26, 2023 • edited

nf-core lint overall result: Failed ❌

❌ Test failures:

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

d4straub commented Sep 27, 2023

d4straub left a comment

Choose a reason for hiding this comment

sminot commented Sep 27, 2023

d4straub left a comment • edited

Choose a reason for hiding this comment

sminot commented Sep 28, 2023

github-actions bot commented Sep 26, 2023 •

edited

`nf-core lint` overall result: Failed ❌

d4straub left a comment •

edited