Reorganise pipeline tests into flat structure #1280

adamrtalbot · 2024-04-10T10:34:51Z

Changes:

Move all test files from tests/tests/ to test/*.nf.test
Makes structure more flat and discovery easier at the cost of looking more untidy.
To be discussed in PR!

The key question is, how do we want to organise the pipeline level tests for pipelines with lots of tests?

Answers below please...

PR checklist

Changes: - Move all test files from tests/tests/ to test/*.nf.test - Makes structure more flat and discovery easier at the cost of looking more untidy. - To be discussed in PR!

maxulysse

Loving it

github-actions · 2024-04-10T10:40:59Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit c259f30

+| ✅ 167 tests passed       |+
#| ❔   9 tests were ignored |#
!| ❗   7 tests had warnings |!

❗ Test warnings:

files_exist - File not found: assets/multiqc_config.yml
files_exist - File not found: .github/workflows/awstest.yml
files_exist - File not found: .github/workflows/awsfulltest.yml
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!

❔ Tests ignored:

files_exist - File is ignored: conf/modules.config
nextflow_config - Config default ignored: params.ribo_database_manifest
files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
files_unchanged - File ignored due to lint config: assets/email_template.html
files_unchanged - File ignored due to lint config: assets/email_template.txt
files_unchanged - File ignored due to lint config: .gitignore or .prettierignore or pyproject.toml
actions_ci - actions_ci
actions_awstest - 'awstest.yml' workflow not found: /home/runner/work/rnaseq/rnaseq/.github/workflows/awstest.yml
multiqc_config - 'assets/multiqc_config.yml' not found

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-rnaseq_logo_light.png
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-rnaseq_logo_light.png
files_exist - File found: docs/images/nf-core-rnaseq_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: modules.json
files_exist - File found: pyproject.toml
files_exist - File not found check: Singularity
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: docs/images/nf-core-rnaseq_logo.png
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/WorkflowRnaseq.groovy
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 3.15.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.hisat2_build_memory= 200.GB
nextflow_config - Config default value correct: params.gtf_extra_attributes= gene_name
nextflow_config - Config default value correct: params.gtf_group_features= gene_id
nextflow_config - Config default value correct: params.featurecounts_group_type= gene_biotype
nextflow_config - Config default value correct: params.featurecounts_feature_type= exon
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes
nextflow_config - Config default value correct: params.trimmer= trimgalore
nextflow_config - Config default value correct: params.min_trimmed_reads= 10000
nextflow_config - Config default value correct: params.umitools_extract_method= string
nextflow_config - Config default value correct: params.umitools_grouping_method= directional
nextflow_config - Config default value correct: params.aligner= star_salmon
nextflow_config - Config default value correct: params.pseudo_aligner_kmer_size= 31
nextflow_config - Config default value correct: params.min_mapped_reads= 5.0
nextflow_config - Config default value correct: params.kallisto_quant_fraglen= 200
nextflow_config - Config default value correct: params.kallisto_quant_fraglen_sd= 200
nextflow_config - Config default value correct: params.deseq2_vst= true
nextflow_config - Config default value correct: params.rseqc_modules= bam_stat,inner_distance,infer_experiment,junction_annotation,junction_saturation,read_distribution,read_duplication
nextflow_config - Config default value correct: params.skip_bbsplit= true
nextflow_config - Config default value correct: params.skip_preseq= true
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.max_cpus= 16
nextflow_config - Config default value correct: params.max_memory= 128.GB
nextflow_config - Config default value correct: params.max_time= 240.h
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-rnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-rnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-rnaseq_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
readme - README Nextflow minimum version badge matched config. Badge: 23.04.0, Config: 23.04.0
readme - README Zenodo placeholder was replaced with DOI.
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (540 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: cloud_tests_small.yml
actions_schema_validation - Workflow validation passed: cloud_tests_full.yml
actions_schema_validation - Workflow validation passed: branch.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'

Run details

nf-core/tools version 2.13.1
Run at 2024-04-10 13:56:56

pinin4fjords

Looks fine, but missing most of the snaps, no?

adamrtalbot · 2024-04-10T10:49:51Z

Looks fine, but missing most of the snaps, no?

There are no snaps.

You're welcome to write snaps for the pipeline tests, I gave up after about 10 hours.

MatthiasZepper · 2024-04-10T11:22:02Z

how do we want to organise the pipeline level tests for pipelines with lots of tests?

Without knowing what has been discussed before...how many different tests do you expect a pipeline will have?

Since all modules and subworkflows should already have been tested, most unit tests are done (except for local ones) and the total number of tests should not be more than a dozen or so? And do you intend to stick with having one file per test?

BTW: Don't you want to get rid of the hardcoded https://raw.githubusercontent.com/nf-core/test-datasets/7f1614baeb0ddf66e60be78c3d9fa55440465ac8/samplesheet/v3.10/samplesheet_test.csv in the process?

adamrtalbot · 2024-04-10T13:21:08Z

Without knowing what has been discussed before...how many different tests do you expect a pipeline will have?

For most pipelines 3-10. For rnaseq 10-20, for Sarek 200+

Since all modules and subworkflows should already have been tested, most unit tests are done (except for local ones) and the total number of tests should not be more than a dozen or so? And do you intend to stick with having one file per test?

Yep, I've removed a few tests on the basis that the subworkflow covers the use case. The main reason for a pipeline level test is a big parameter like --skip_qc which is used throughout the pipeline.

BTW: Don't you want to get rid of the hardcoded https://raw.githubusercontent.com/nf-core/test-datasets/7f1614baeb0ddf66e60be78c3d9fa55440465ac8/samplesheet/v3.10/samplesheet_test.csv in the process?

I can't actually remember why we didn't do this in the first place 🤔

adamrtalbot · 2024-04-10T13:53:24Z

OK so params.pipelines_testdata_base_path doesn't work in a params block:

params {
    outdir = "$outputDir"
    input  = "${params.pipelines_testdata_base_path}/csv/samplesheet_test.csv"
}

But turns out we can just remove it and it retrieves it from the -profile test.

MatthiasZepper · 2024-04-10T14:03:27Z

Without knowing what has been discussed before...how many different tests do you expect a pipeline will have?
For most pipelines 3-10. For rnaseq 10-20, for Sarek 200+

That number for Sarek sounds excessive, but would clearly argue against your proposed flat structure, then? At least if we are talking about so many files in a single folder....one might also not want to run each of them every time, so some sort of easy subset mechanism would be desirable.

I don't know about nf-test, but with pytest -k / -m once can for example easily select particular tests. If nf-test has nothing of that kind, subfolders could provide a convenient way to run selected tests only?

Yep, I've removed a few tests on the basis that the subworkflow covers the use case. The main reason for a pipeline level test is a big parameter like --skip_qc which is used throughout the pipeline.

Appreciated, and I agree that integration tests are paramount! Yet, a single workflow.success is not a very specific test condition. Is it planned to extend the tests later for more precise conditions, or is this what you wasted the 10h on specifically?

edmundmiller · 2024-04-10T14:55:07Z

I like! Not sure if there will be too many snaps one day, but I think we cross that bridge when we get there.

You're welcome to write snaps for the pipeline tests, I gave up after about 10 hours.

Yeah this is how long hic took me as well.

I'm wondering if we make a compromise and just snapshot the featurecounts output or something? It's far enough down stream that any changes will get caught.

I also find myself copying the same list of things to snapshot and assert across files. I wonder if we could drop them in tests/lib/pipeline_outputs.groovy and just one line include and update all expected files in one go? This was a huge pain in methylseq(@sateeshperi)

adamrtalbot · 2024-04-10T15:25:51Z

That number for Sarek sounds excessive, but would clearly argue against your proposed flat structure, then? At least if we are talking about so many files in a single folder....one might also not want to run each of them every time, so some sort of easy subset mechanism would be desirable.

I don't know about nf-test, but with pytest -k / -m once can for example easily select particular tests. If nf-test has nothing of that kind, subfolders could provide a convenient way to run selected tests only?

You can use tags, names or paths to select nf-test files. I originally grouped them into directories (tests/tests) but it looked a bit ugly. @maxulysse suggested this layout but I opened this PR as a starting point for a discussion. How do people want to organise their pipeline tests? Do lots of subfolders make sense or would a flatter structure be easier to understand?

Regarding snapshots, I'm going to work on that some other time. But yes, some form of specific snapshot will be needed. For now, we just want to recreate the existing pipeline tests.

adamrtalbot · 2024-04-10T15:39:03Z

OK my decision so far, let's go with this flat structure. Individual pipeline developers can create some more granularity with tags/directories/naming if they like, we won't enforce a very strict structure here.

Reorganise pipeline tests into flat structure

d6206f4

Changes: - Move all test files from tests/tests/ to test/*.nf.test - Makes structure more flat and discovery easier at the cost of looking more untidy. - To be discussed in PR!

adamrtalbot requested review from maxulysse and pinin4fjords April 10, 2024 10:35

adamrtalbot added this to the nf-test milestone Apr 10, 2024

maxulysse approved these changes Apr 10, 2024

View reviewed changes

pinin4fjords reviewed Apr 10, 2024

View reviewed changes

adamrtalbot added 2 commits April 10, 2024 14:54

Use profile test --input parameter

d1c636a

indentation

c259f30

edmundmiller approved these changes Apr 10, 2024

View reviewed changes

adamrtalbot merged commit 5833810 into nf-core:dev Apr 10, 2024
184 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reorganise pipeline tests into flat structure #1280

Reorganise pipeline tests into flat structure #1280

adamrtalbot commented Apr 10, 2024 •

edited

Loading

maxulysse left a comment

github-actions bot commented Apr 10, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

pinin4fjords left a comment

adamrtalbot commented Apr 10, 2024

MatthiasZepper commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

MatthiasZepper commented Apr 10, 2024

edmundmiller commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

Reorganise pipeline tests into flat structure #1280

Reorganise pipeline tests into flat structure #1280

Conversation

adamrtalbot commented Apr 10, 2024 • edited Loading

PR checklist

maxulysse left a comment

Choose a reason for hiding this comment

github-actions bot commented Apr 10, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

pinin4fjords left a comment

Choose a reason for hiding this comment

adamrtalbot commented Apr 10, 2024

MatthiasZepper commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

MatthiasZepper commented Apr 10, 2024

edmundmiller commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024

adamrtalbot commented Apr 10, 2024 •

edited

Loading

github-actions bot commented Apr 10, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️