Refactor: consolidate BIDS, add orchestration layer, extract longitudinal by nx10 · Pull Request #267 · childmindresearch/rbc

nx10 · 2026-04-03T20:46:02Z

Major architecture refactor that introduces clean separation of concerns across four layers:

bids/ - BIDS naming contracts (discover, resolve, export per workflow)
orchestration/ - Pipeline loops (filter, iterate, discover -> process -> export)
workflows/ - Processing step chains (computation only)
cli/ - Arg parsing only (delegates to orchestration)

What changed

New rbc.bids/ package (moved from core/bids/ + core/bids2table.py + cli/query.py):

builder.py - Bids class
_schema.py - auto-generated entity validation
query.py - bids2table wrappers
session.py - session loading, iteration, grouping constants
anatomical.py - discover + export
functional.py - discover + resolve + export, with FunctionalInputs TypedDict
metrics.py - resolve + export, with MetricsInputs TypedDict
qc.py - resolve + export, with QCInputs TypedDict
longitudinal.py - resolve + export for longitudinal workflows

New rbc.orchestration/ package:

__init__.py - Filters dataclass (participant/session/task filters)
anatomical.py - run() + process_session()
functional.py - run() + process_session() (returns outputs for downstream use)
metrics.py - run() + process_run()
qc.py - run()
all.py - run() composing per-session stages with in-memory output passing
longitudinal.py - run() + process_anat() + process_func()

Simplified CLI modules: Each CLI is now ~40-60 lines (Args dataclass + main() that calls orchestration.run() + register_command()). Zero BIDS or workflow logic.

Deleted

src/rbc/core/bids/ (entire package)
src/rbc/core/bids2table.py
src/rbc/cli/query.py

Bugs fixed

Atlas names with underscores (schaefer_200) caused BIDS validation errors
Regressor names with hyphens (36-parameter) caused the same issue
QC resolve used .mat extension but functional export writes .txt

Tests

Moved: test_bids.py, test_bids2table.py, test_query.py to tests/unit/bids/
New: tests/unit/bids/test_exports.py (11 tests with real Bids instances)
Updated: all CLI test mock patches target orchestration modules

Closes #259, closes #266.

Move all BIDS-related code from scattered locations into a cohesive rbc.bids/ package and extract per-workflow export/resolve functions from CLI modules. Structural changes: - core/bids/ -> bids/builder.py (Bids class), bids/_schema.py (auto-generated) - core/bids2table.py -> bids/query.py (load_table, find_file, etc.) - New bids/anatomical.py, functional.py, metrics.py, qc.py with export_*() and resolve_*() functions extracted from CLI modules - Tests moved: test_bids.py, test_bids2table.py -> tests/unit/bids/ - New tests/unit/bids/test_exports.py with real Bids instance tests Bug fixes caught during refactor: - Atlas names with underscores (schaefer_200) caused BIDS validation errors; now sanitized via bids_safe_label() inside export functions - Regressor names with hyphens (36-parameter) had the same issue - QC resolve used .mat extension but functional export writes .txt The CLI modules are now thin orchestration layers that call shared export/resolve functions instead of duplicating BIDS naming logic.

Unifies the pattern across all resolve functions: resolve_qc already returned QCInputs (TypedDict), now resolve_functional returns FunctionalInputs and resolve_metrics returns MetricsInputs. This gives callers proper type safety when accessing resolved paths. Part of #259.

Moves SessionTables, load_session, iter_session_files, and the BIDS grouping constants (SUB_SES_QUERY, ANAT_GROUP_ENTITIES, FUNC_GROUP_ENTITIES) from cli/ into bids/session.py so that cli/ exclusively handles CLI concerns. Constants are renamed to drop the leading underscore since they are now public API in the bids package. Closes #259.

Adds discover_anatomical(), discover_functional(), and discover_derivative_runs() to the bids package, removing BIDS-specific iteration logic (entity extraction, row-to-path conversion, DataFrame filtering/grouping) from CLI modules. CLI modules now call discovery functions and receive structured NamedTuples (AnatomicalRun, FunctionalRun, DerivativeRun) instead of manually iterating DataFrames.

github-actions · 2026-04-03T20:46:45Z

Coverage Report

File	Stmts	Miss	Cover	Missing
rbc
__init__.py	1	0	100%
context.py	25	8	68%	70, 75–77, 79–80, 93–94
metadata.py	56	0	100%
rbc/bids
__init__.py	9	0	100%
_schema.py	585	4	99%	776, 782, 790, 1101
anatomical.py	22	0	100%
builder.py	72	7	90%	233–235, 362, 364–365, 386
functional.py	42	4	90%	46–49
longitudinal.py	25	0	100%
metrics.py	23	0	100%
qc.py	17	0	100%
query.py	68	42	38%	100–104, 118–122, 124–125, 127–132, 134, 136, 150, 156–162, 195, 204, 206–213, 222, 254, 263–264
session.py	47	0	100%
rbc/cli
__init__.py	1	0	100%
all.py	35	0	100%
anatomical.py	17	0	100%
base.py	37	1	97%	51
functional.py	25	0	100%
longitudinal.py	23	0	100%
main.py	43	0	100%
metrics.py	32	0	100%
qc.py	25	0	100%
rbc/core
__init__.py	3	0	100%
common.py	21	8	61%	42–44, 58, 61–64
fileops.py	27	4	85%	69–72
fsl2itk.py	42	0	100%
nifti.py	192	5	97%	236–237, 244–245, 524
niwrap.py	56	1	98%	58
rbc/core/anatomical
__init__.py	4	0	100%
registration.py	14	4	71%	45, 137, 152, 167
segmentation.py	24	8	66%	61, 71–73, 89, 111, 122, 138
rbc/core/functional
__init__.py	13	0	100%
coregistration.py	7	2	71%	44, 55
despiking.py	7	3	57%	32, 36–37
distortion.py	130	40	69%	269–271, 321, 324, 332–335, 341–346, 349, 352–353, 356, 365–369, 375–376, 387–388, 391, 397, 443–444, 447, 455, 461, 470–471, 474, 484, 491
erosion.py	32	1	96%	50
initialization.py	9	4	55%	35, 42–43, 63
masking.py	34	25	26%	53, 55–56, 58–59, 62–65, 69, 91, 134, 183, 197, 208, 223, 233, 249, 258, 271, 285, 296, 306, 319, 328
motion.py	57	37	35%	62, 64–67, 69, 71, 73, 76–77, 83–84, 86–87, 95–97, 99, 102, 105, 107, 124–125, 135–138, 159, 169, 171–172, 175, 177–179, 181, 183
nuisance.py	81	60	25%	78, 80–85, 87, 89–90, 93, 96–98, 100–102, 104–110, 112–116, 118, 163, 165, 167, 170–171, 173–175, 178, 181–182, 185–187, 190–193, 197, 203–205, 207, 235, 243, 269, 278, 307, 316, 322
regressors.py	89	6	93%	163, 193, 325–328
resampling.py	54	43	20%	37–42, 74–76, 78, 80–81, 85–87, 90, 93, 105, 107–108, 112, 114, 150–152, 154–155, 159, 161–162, 167–169, 173–174, 177, 180, 187, 199, 201–202, 207, 209
timing.py	16	11	31%	46–47, 49–53, 58, 60, 66–67
rbc/core/longitudinal
__init__.py	1	0	100%
transform.py	46	7	84%	106–107, 165–168, 170
rbc/core/metrics
__init__.py	3	0	100%
alff.py	90	1	98%	265
reho.py	66	0	100%
smoothing.py	7	3	57%	36, 42–43
standardization.py	27	11	59%	64, 66–68, 70, 72–75, 77–78
timeseries.py	57	1	98%	120
rbc/core/qc
__init__.py	6	0	100%
dvars.py	26	0	100%
motion.py	31	0	100%
registration.py	41	0	100%
xcp.py	41	0	100%
rbc/orchestration
__init__.py	19	2	89%	64, 67
all.py	47	0	100%
anatomical.py	41	2	95%	47–48
functional.py	45	0	100%
longitudinal.py	60	1	98%	146
metrics.py	54	5	90%	36, 39, 68, 76–77
qc.py	41	0	100%
rbc/workflows
__init__.py	10	0	100%
anatomical.py	45	19	57%	76–85, 87, 154–157, 159–162
functional.py	98	56	42%	113–114, 122, 127–128, 136, 201–202, 205–206, 209–210, 213, 216–219, 229–231, 241–242, 245, 248, 251–252, 260–261, 268–269, 276–277, 284–285, 288–290, 293–296, 308–309, 321, 325–328, 331–332, 341–342, 350, 357, 426, 432
metrics.py	40	17	57%	83–84, 89–90, 93–96, 99–102, 106–109, 111
qc.py	51	31	39%	85, 87–88, 91, 94, 97–101, 104, 113–115, 119–122, 124–127, 129–130, 132–134, 137, 154, 161, 163
rbc_resources
__init__.py	30	0	100%
TOTAL	3165	484	84%

Tests	Skipped	Failures	Errors	Time
763	0 💤	0 ❌	0 🔥	11.812s ⏱️

Creates src/rbc/orchestration/ with per-workflow run() functions that own the full pipeline loop: filtering, sub/ses iteration, discovery, processing, and export. CLI modules are now thin arg-parsing wrappers that delegate to orchestration.run(). New modules: - orchestration/__init__.py: Filters dataclass - orchestration/anatomical.py: run() + process_session() - orchestration/functional.py: run() + process_session() - orchestration/metrics.py: run() + process_run() - orchestration/qc.py: run() - orchestration/all.py: run() composing per-session stages - orchestration/longitudinal.py: run() + process_anat() + process_func() - bids/longitudinal.py: resolve + export for longitudinal workflow CLI modules now contain only: Args dataclass, main() (setup runner + call orchestration.run()), and register_command(). Closes #266.

Orchestration run() functions now handle runner setup (init_runner) and workflow start/complete logging. CLI modules are reduced to pure arg parsing: construct Filters + RunnerConfig, call run(), return 0. Introduces RunnerConfig dataclass and init_runner() in orchestration. Moves _DEFAULT_ENV_VARS from cli/__init__.py to orchestration/.

kaitj

A few questions, one small documentation change, and a few notes to revisit, otherwise largely looks good.

kaitj · 2026-04-06T14:20:26Z

+    aex.save(
+        _require_file(outputs.brain_mask, "brain_mask"),
+        suffix=Suffix.MASK,
+        desc="T1w",
+    )
+    aex.save(
+        _require_file(outputs.csf_mask, "csf_mask"),
+        suffix=Suffix.MASK,
+        desc="csf",
+    )
+    aex.save(
+        _require_file(outputs.gm_mask, "gm_mask"),
+        suffix=Suffix.MASK,
+        desc="gm",
+    )
+    aex.save(
+        _require_file(outputs.wm_mask, "wm_mask"),
+        suffix=Suffix.MASK,
+        desc="wm",
+    )


We should revisit if these files are still needed. I think this was originally included because they were needed to compute regressors in template space (which isn't the case anymore I think).

kaitj · 2026-04-06T14:22:02Z

+    func_q: Bids,
+    tpl_q: Bids,
+    func_df: pl.DataFrame,
+    tpl_df: pl.DataFrame,


More I look at this, I wonder if we should rename this variable to make distinct "standard" templates (e.g. MNI, OASIS, etc.) vs "longitudinal template" or is that more confusing?

kaitj · 2026-04-06T14:23:26Z

+            extension=".txt",
+            extra={"from": "bold", "to": "T1w", "mode": "image"},
+        ),
+        "sbref": func_q.expect(func_df, suffix=Suffix.SBREF, without=["space"]),


Oh cool, first time I noticed the without argument 🚀

- Resolve merge conflicts (keep orchestration layer, discard old inline logic) - Fix docstring paths in generate_bids_tools.py (#1) - Remove redundant extension=Extension.NII_GZ in longitudinal export (#5) - Clarify verbose docstring in RunnerConfig (#6)

nx10 added 5 commits April 3, 2026 15:13

Regenerate test_bids.py for updated import path

f98692d

nx10 changed the title ~~Consolidate BIDS code and extract discovery/export/resolve from CLIs~~ Refactor: consolidate BIDS, add orchestration layer, extract longitudinal Apr 3, 2026

This was referenced Apr 3, 2026

Restructure tests to match new architecture layers #268

Closed

Add orchestration layer between CLI and workflows #266

Closed

nx10 added the refactor Alterations of code that do not affect function label Apr 3, 2026

nx10 requested review from jpillai00 and kaitj April 3, 2026 21:34

nx10 mentioned this pull request Apr 3, 2026

Fix BIDS label validation for atlas and regressor names #269

Merged

kaitj reviewed Apr 6, 2026

View reviewed changes

kaitj approved these changes Apr 6, 2026

View reviewed changes

nx10 merged commit 8942e83 into main Apr 6, 2026
8 checks passed

nx10 deleted the refactor/bids-consolidation branch April 6, 2026 19:09

kaitj mentioned this pull request Apr 7, 2026

Simplify main in cli/all.py #208

Closed

nx10 mentioned this pull request Apr 8, 2026

Allow custom templates and atlases via CLI #238

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: consolidate BIDS, add orchestration layer, extract longitudinal#267

Refactor: consolidate BIDS, add orchestration layer, extract longitudinal#267
nx10 merged 8 commits into
mainfrom
refactor/bids-consolidation

nx10 commented Apr 3, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 3, 2026 •

edited

Loading

Uh oh!

kaitj left a comment

Uh oh!

Uh oh!

kaitj Apr 6, 2026

Uh oh!

kaitj Apr 6, 2026

Uh oh!

kaitj Apr 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nx10 commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Deleted

Bugs fixed

Tests

Uh oh!

github-actions Bot commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaitj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kaitj Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

kaitj Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

kaitj Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nx10 commented Apr 3, 2026 •

edited

Loading

github-actions Bot commented Apr 3, 2026 •

edited

Loading