Fix warnings, clean namings and overall improvement by FerriolCalvet · Pull Request #414 · bbglab/deepCSA

FerriolCalvet · 2026-01-21T22:06:38Z

AI summary

This pull request introduces several significant changes to the pipeline's configuration and output management, focusing on improved organization of output files, clearer parameter naming, and enhanced maintainability. The most important updates include a comprehensive reorganization of output directories, renaming and consolidation of process parameters, and the addition of a dedicated configuration file for result outputs.

Output Directory and Process Organization:

Output directories for many processes have been standardized and reorganized to use clearer, more descriptive paths (e.g., moving outputs into processing_files, mutations, plots, qc, etc.) to improve discoverability and maintainability. (conf/modules.config [1] [2] [3] [4] [5] [6]
A new configuration file, results_outputs.config, was added to centralize and fine-tune the output locations and patterns for plots, QC, and annotation steps, further enhancing output consistency. (conf/results_outputs.config [1] conf/modules.config [2]

Process and Parameter Refactoring:

Several process names and parameters were renamed for clarity and consistency, such as changing SUBSET* processes to QUERY*, and updating parameters like omega_withingene to create_subgenic_regions. (conf/modules.config [1] [2] [3] [4] [5]
Deprecated or redundant configuration files and parameters were removed, including the deletion of conf/general_files_BBG.config. (conf/general_files_BBG.config conf/general_files_BBG.configL1-L28)

Schema and Documentation Updates:

Improved error messages for input schema fields to be more descriptive and user-friendly, and updated required fields for sample input. (assets/schema_input.json assets/schema_input.jsonL28-R36)
Enhanced script documentation with clearer usage examples and references to input data sources. (assets/useful_scripts/signatures_sigprofilerextractor.py assets/useful_scripts/signatures_sigprofilerextractor.pyL38-R40)

Other Notable Changes:

Adjusted publishDir settings for specific processes to disable output where appropriate or to refine file patterns and save logic. (conf/modules.config [1] [2]
Minor logic tweaks, such as removal of unnecessary filter assignments. (conf/modules.config conf/modules.configL253)

These changes collectively improve the clarity, usability, and maintainability of the pipeline's configuration and outputs.

- fix remaining linting

- not tested

-minor fixes and additional updates

Copilot

Pull request overview

This pull request implements a comprehensive refactoring focused on parameter standardization, process renaming, and workflow improvements. The changes replace omega_* prefixed parameters with more general subgenic_* naming, rename SUBSET* processes to QUERY* for clarity, and standardize channel factory methods from Channel to lowercase channel. Additionally, the PR introduces optional BAM file requirements, adds new QC plotting capabilities, and reorganizes output publishing through a new configuration file.

Changes:

Standardized parameter names from omega_withingene, omega_autodomains, omega_autoexons, omega_subgenic_bedfile to create_subgenic_regions, autodomains, autoexons, subgenic_bedfile across all configuration files, workflows, modules, and documentation
Renamed processes from SUBSET* to QUERY* (e.g., SUBSETMUTATIONS → QUERYMUTATIONS) for better semantic clarity
Standardized channel factory methods from Channel to lowercase channel throughout the codebase
Made BAM files optional in the input samplesheet when using custom depths, with corresponding validation updates
Added new results_outputs.config for centralized output directory management and PLOTTINGQC subworkflow with omega QC filtering
Updated default parameter values including consensus_panel_min_depth (500→200), mutation_depth_threshold (40→100), and hotspot_expansion (30→0)

Reviewed changes

Copilot reviewed 44 out of 52 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
workflows/deepcsa.nf	Updated process names (SUBSET→QUERY, SYNMUTREADSRATE→SYNMUTREADSDENSITY), channel factory standardization, BAM handling logic, and PLOTTINGQC integration
subworkflows/local/*.nf	Consistent renaming of SUBSET to QUERY processes, channel factory standardization, removal of unnecessary `.first()` calls on process outputs
modules/local/expand_regions/main.nf	Updated parameter references from `omega_*` to standardized names
nextflow.config	Renamed parameters, reordered filter criteria, updated default thresholds for depth and mutation filtering
nextflow_schema.json	Updated schema with new parameter names, descriptions, and reorganized options sections
conf/tools/*.config	Updated process names and added publish directories for expanded regions outputs
conf/results_outputs.config	New file centralizing output publication paths for plots and QC processes
bin/*.py	Updated mutation density metric names (MUTREADSRATE→MUTREADSDENSITY), added omega flagging support, improved BAM column handling
docs/*.md	Updated documentation to reflect new parameter names and process changes
subworkflows/nf-core/utils_*.nf	Channel factory standardization and function name updates

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

FerriolCalvet · 2026-01-22T09:49:34Z

update filter to flag

- most should be fixed

- fixed threshold at 10%

- add to FILTER and make it optional

SPA = sigprofilerassignment

Copilot

Pull request overview

Copilot reviewed 53 out of 71 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

- Add computation of relative coverage of the gene Fixes #415

- per consensus panel - per sample

m-huertasp

I really like the changes you've added (which are quite a lot). I've been adding some minor coments and I've tested some things, everything looks quite good!

There are more warnings about the .first() thing than before, I've been checking and I still don't understand why or how to solve them all without crashing something.

If you want we can discuss about an coment!

FerriolCalvet

thanks for the comments and testing things Marta!!

I am merging it and let's add you changes and we will see if there is any corner case that we missed to test

FerriolCalvet and others added 17 commits December 25, 2025 13:14

fix linting warnings

f7e3c35

additional fixes

c355494

update configs and panel outputs

94601fc

rename to MUTREADSDENSITY

74c37a5

- fix remaining linting

rename subsetmutations to querymutations

f642b94

rename subsetdepths to querydepths

eec7579

minor renaming of subgenic level info

adfbed2

fix after quick test

8f7f953

update plots outputs and tower reports

67c002c

remove some redundant .first

32c92c3

add all flagged omegas to results

ab1737f

- not tested

restrict omega selection plotting to non-flagged

75739a4

- not tested

allow only vcf input + custom depths

96c3c4a

-minor fixes and additional updates

Merge branch 'dev' into fixformatting

e26b9eb

fix first bugs after testing

b9cbb8f

update parameter filtering order

aebae5b

fix bug in mutdensityqc optional output

771630b

FerriolCalvet requested a review from Copilot January 22, 2026 09:11

Copilot started reviewing on behalf of FerriolCalvet January 22, 2026 09:11 View session

FerriolCalvet added this to the Phase 2 milestone Jan 22, 2026

Copilot AI reviewed Jan 22, 2026

View reviewed changes

FerriolCalvet added 8 commits January 22, 2026 11:47

add firsts to preprocesiing inputs

e70ef3e

add some more firsts

c37b95d

update several outputs

75f46de

- most should be fixed

fix bug due to typo

38d1cb5

updates in output paths and cleaning

b0f2a86

fix the storing of mutdensities and site selection

b459687

fix bug in channel building

4de00b6

update contamination script

f1ec6c2

FerriolCalvet mentioned this pull request Jan 31, 2026

All mutations being filtered from project #416

Closed

FerriolCalvet linked an issue Jan 31, 2026 that may be closed by this pull request

All mutations being filtered from project #416

Closed

FerriolCalvet added 8 commits February 2, 2026 00:30

turn off additional depth plots unless requested

7e2b54b

fix bug in contamination script when empty

240a0ff

add filtering of vaf of Ns threshold

e2be432

- fixed threshold at 10%

add min_signatures argument to sigprofiler extractor script

cbbf1b0

remove automatic VAF_distorted_expanded_sq

2e75498

- add to FILTER and make it optional

raise error if consensus panel is empty

f28c5e5

allow turn off of exclude_signatures in SPA

8b575bc

SPA = sigprofilerassignment

clean config and update metadata

d2b4f3a

FerriolCalvet requested a review from Copilot February 6, 2026 13:37

Copilot started reviewing on behalf of FerriolCalvet February 6, 2026 13:38 View session

Copilot AI reviewed Feb 6, 2026

View reviewed changes

Comment thread modules/local/plot/qc/mutation_densities/main.nf

Comment thread conf/results_outputs.config

Comment thread nextflow.config

Comment thread modules/local/compute_mutability/main.nf

Comment thread modules/local/signatures/sigprofiler/assignment/main.nf

FerriolCalvet requested review from m-huertasp February 6, 2026 14:10

add relative gene coverage

cdfbbf7

- Add computation of relative coverage of the gene Fixes #415

This was linked to issues Feb 7, 2026

Add computation of relative coverage of the gene #415

Closed

revise contamination outputs #344

Closed

FerriolCalvet added 6 commits February 9, 2026 10:29

add trinucleotide proportions comparisons

51799c5

- per consensus panel - per sample

add trinucleotide proportion plots to output

76108b0

add gene coverage plots to output

871b7f7

update depths summary output location

782794f

final output format for depths

5fbed7d

define deepcsa_core container label

82f4994

m-huertasp reviewed Feb 10, 2026

View reviewed changes

FerriolCalvet commented Feb 11, 2026

View reviewed changes

apply review comments

070015f

FerriolCalvet merged commit 281d63b into dev Feb 11, 2026

FerriolCalvet mentioned this pull request Feb 11, 2026

Update structure of outputs bbglab/deepUMIcaller#178

Closed

FerriolCalvet deleted the fixformatting branch February 11, 2026 13:50

Conversation

FerriolCalvet commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FerriolCalvet commented Jan 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

m-huertasp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FerriolCalvet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FerriolCalvet commented Jan 21, 2026 •

edited

Loading