Fix warnings, clean namings and overall improvement#414
Conversation
- fix remaining linting
- not tested
- not tested
-minor fixes and additional updates
There was a problem hiding this comment.
Pull request overview
This pull request implements a comprehensive refactoring focused on parameter standardization, process renaming, and workflow improvements. The changes replace omega_* prefixed parameters with more general subgenic_* naming, rename SUBSET* processes to QUERY* for clarity, and standardize channel factory methods from Channel to lowercase channel. Additionally, the PR introduces optional BAM file requirements, adds new QC plotting capabilities, and reorganizes output publishing through a new configuration file.
Changes:
- Standardized parameter names from
omega_withingene,omega_autodomains,omega_autoexons,omega_subgenic_bedfiletocreate_subgenic_regions,autodomains,autoexons,subgenic_bedfileacross all configuration files, workflows, modules, and documentation - Renamed processes from
SUBSET*toQUERY*(e.g.,SUBSETMUTATIONS→QUERYMUTATIONS) for better semantic clarity - Standardized channel factory methods from
Channelto lowercasechannelthroughout the codebase - Made BAM files optional in the input samplesheet when using custom depths, with corresponding validation updates
- Added new
results_outputs.configfor centralized output directory management and PLOTTINGQC subworkflow with omega QC filtering - Updated default parameter values including
consensus_panel_min_depth(500→200),mutation_depth_threshold(40→100), andhotspot_expansion(30→0)
Reviewed changes
Copilot reviewed 44 out of 52 changed files in this pull request and generated 9 comments.
Show a summary per file
| File | Description |
|---|---|
| workflows/deepcsa.nf | Updated process names (SUBSET→QUERY, SYNMUTREADSRATE→SYNMUTREADSDENSITY), channel factory standardization, BAM handling logic, and PLOTTINGQC integration |
| subworkflows/local/*.nf | Consistent renaming of SUBSET to QUERY processes, channel factory standardization, removal of unnecessary .first() calls on process outputs |
| modules/local/expand_regions/main.nf | Updated parameter references from omega_* to standardized names |
| nextflow.config | Renamed parameters, reordered filter criteria, updated default thresholds for depth and mutation filtering |
| nextflow_schema.json | Updated schema with new parameter names, descriptions, and reorganized options sections |
| conf/tools/*.config | Updated process names and added publish directories for expanded regions outputs |
| conf/results_outputs.config | New file centralizing output publication paths for plots and QC processes |
| bin/*.py | Updated mutation density metric names (MUTREADSRATE→MUTREADSDENSITY), added omega flagging support, improved BAM column handling |
| docs/*.md | Updated documentation to reflect new parameter names and process changes |
| subworkflows/nf-core/utils_*.nf | Channel factory standardization and function name updates |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
|
update filter to flag |
- most should be fixed
- fixed threshold at 10%
- add to FILTER and make it optional
SPA = sigprofilerassignment
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 53 out of 71 changed files in this pull request and generated 5 comments.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Add computation of relative coverage of the gene Fixes #415
- per consensus panel - per sample
m-huertasp
left a comment
There was a problem hiding this comment.
I really like the changes you've added (which are quite a lot). I've been adding some minor coments and I've tested some things, everything looks quite good!
There are more warnings about the .first() thing than before, I've been checking and I still don't understand why or how to solve them all without crashing something.
If you want we can discuss about an coment!
FerriolCalvet
left a comment
There was a problem hiding this comment.
thanks for the comments and testing things Marta!!
I am merging it and let's add you changes and we will see if there is any corner case that we missed to test
AI summary
This pull request introduces several significant changes to the pipeline's configuration and output management, focusing on improved organization of output files, clearer parameter naming, and enhanced maintainability. The most important updates include a comprehensive reorganization of output directories, renaming and consolidation of process parameters, and the addition of a dedicated configuration file for result outputs.
Output Directory and Process Organization:
processing_files,mutations,plots,qc, etc.) to improve discoverability and maintainability. (conf/modules.config[1] [2] [3] [4] [5] [6]results_outputs.config, was added to centralize and fine-tune the output locations and patterns for plots, QC, and annotation steps, further enhancing output consistency. (conf/results_outputs.config[1]conf/modules.config[2]Process and Parameter Refactoring:
SUBSET*processes toQUERY*, and updating parameters likeomega_withingenetocreate_subgenic_regions. (conf/modules.config[1] [2] [3] [4] [5]conf/general_files_BBG.config. (conf/general_files_BBG.configconf/general_files_BBG.configL1-L28)Schema and Documentation Updates:
assets/schema_input.jsonassets/schema_input.jsonL28-R36)assets/useful_scripts/signatures_sigprofilerextractor.pyassets/useful_scripts/signatures_sigprofilerextractor.pyL38-R40)Other Notable Changes:
conf/modules.config[1] [2]conf/modules.configconf/modules.configL253)These changes collectively improve the clarity, usability, and maintainability of the pipeline's configuration and outputs.