feat(lib): section/title inference — multi-pattern cascade, CSI lookup, Jaccard fuzzy match by thewrz · Pull Request #62 · wrzonance/SpecR

thewrz · 2026-05-17T03:40:42Z

Summary

Adds `inferSectionMeta(tree)` in `src/lib/infer-section.ts` — pure, no I/O, multi-pattern cascade: Level 1 (`SECTION \d{2} \d{2} \d{2}` keyword, high confidence) → Level 2 (bare number whole-text match, medium confidence) over first 50 nodes
Adds `computeTitleMatch` word-Jaccard fuzzy matcher (`exact`/`close`/`divergent`/`unknown`) — no new deps
`parse()` dispatcher now runs inference after each parser, updates `tree.section`/`tree.title` when inference fires, returns `ParseResult.sectionInference` always
`LoadResult` gains `inferenceWarnings[]` — after each `persistParsedSpec`, looks up standard title from `csi_sections` table, fuzzy-matches, emits warning with both inferred and standard titles when they diverge
`parse_document` MCP response conditionally includes `sectionInference` object — LLM callers get signal to prompt user when confidence is medium or titles diverge
`load_files` MCP response auto-includes `inferenceWarnings[]` — zero noise for clean SEC files, actionable signal for DOCX files missing `dc:subject`
Security: glob-resolved paths in `load_files` now checked through `guardPath` (path traversal fix)

Test plan

`pnpm test` — 380/380 unit tests pass
`pnpm test:integration` — all integration tests pass
`pnpm lint` — clean
Parse ARCAT DOCX via `parse_document` MCP — `sectionInference` present: `method: content-high`, `confidence: high`, `inferredSection: "26 09 33"`, `inferredTitle: "ARCHITECTURAL LIGHTING CONTROL SYSTEM"`
Parse UFGS SEC via `parse_document` MCP — no `sectionInference` field (clean metadata path)
`load_files` with ARCAT DOCXs — `inferenceWarnings[]` populated with `inferredSection`, `inferredTitle`, `confidence: high`, `note` per file

Summary by CodeRabbit

New Features
- Automatic inference of missing section numbers and titles from document content with confidence and match-quality indicators
- Parsing/upload tools and CLI now return inferred section/title and include inference notes when content-based inference is used
- File loading emits structured inference warnings alongside results
Documentation
- New design/spec, planning doc, and ADR describing inference behavior and reference-data interpretation
Tests
- Expanded unit and integration tests covering inference, parsing, loading, and warning behaviors

…I lookup, fuzzy title match

…/title unknown gap

…y section number

…ch word-Jaccard

…ates tree on content match

…y match per file

…tent inference fires

…SI lookup + loadFiles warnings

…ad_files - Extract handler functions to separate handlers.ts module to reduce tools.ts complexity - Add guardPath checks to glob-resolved paths before resolving - Replace path accumulation with discriminated union PathResolution type - All paths (glob + explicit) now validated against project root boundary - Prevents glob patterns like '../../../../etc/passwd' from escaping sandbox - Unit tests pass, lint passes

coderabbitai · 2026-05-17T03:40:53Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds deterministic CSI section/title inference: core algorithm and tests, DB title lookup and re-export, parser wiring to include/apply inference, loader-level inference warnings, MCP handler extraction/enrichment, path-resolution helpers, and integration tests.

Changes

Section/Title Inference Feature

Layer / File(s)	Summary
Design Documentation `docs/superpowers/plans/2026-05-16-section-inference.md`, `docs/superpowers/specs/2026-05-16-section-inference-design.md`, `docs/adr/012-ufgs-as-reference-not-authoritative-csi.md`	Plan, spec, and ADR define the inference architecture, algorithm, data contracts, integration points, error handling, and test expectations.
Database Query for CSI Titles `src/db/queries/search.ts`, `src/db/index.ts`	Adds `lookupCsiSectionTitle(sectionNumber)` to fetch standard titles from `csi_sections`, wraps DB errors in `DatabaseError`, and re-exports it from the DB index.
Core Inference Engine `src/lib/infer-section.ts`, `src/lib/infer-section.test.ts`	Implements `inferSectionMeta()` (keyword/number scans, inline/following title extraction, method/confidence) and `computeTitleMatch()` (token-overlap scoring with exact/close/divergent/unknown categories). Unit tests cover detection and matching scenarios.
Parser Integration & Tree Updates `src/parser/index.ts`, `src/parser/parse.test.ts`	`parse()` now invokes inference, returns `sectionInference` in `ParseResult`, and conditionally applies inferred `section`/`title` to the tree via `applyInference()`. Tests assert inference outcomes.
File Loader & Inference Warnings `src/lib/file-loader.ts`, `src/lib/file-loader.test.ts`	Refactors per-file processing into `processFile()`, introduces `InferenceWarning` and extends `LoadResult` with `inferenceWarnings`. For content-driven inferences, performs DB lookup and fuzzy matching to assemble warnings (skipped on `dryRun` or metadata-based inferences). Tests assert warning generation and error isolation.
MCP Handler Module & Inference Enrichment `src/mcp/handlers.ts`	New `handlers.ts` implements MCP tools; `handleParseDocument()` decodes inputs, parses, enriches `sectionInference` using DB lookup and match scoring, persists specs, counts nodes, and returns structured JSON responses with optional inference notes.
MCP Tools Path Resolution & Delegation `src/mcp/tools.ts`	Delegates handlers to `./handlers.js` and adds path-resolution helpers (`guardPath`, `resolveGlobPaths()`, `resolveExplicitPaths()`, `collectResolvedPaths()`) to validate/aggregate absolute paths under `projectRoot` before calling `loadFiles()`.
Unit & Integration Tests `src/lib/infer-section.test.ts`, `src/lib/infer-section.integration.test.ts`, `src/lib/file-loader.test.ts`	Adds and updates tests for inference logic, DB lookup, parser behavior, loader warnings, and integration with DOCX/SEC fixtures.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

wrzonance/SpecR#60: Extends the universal loader by adding inferenceWarnings computation during per-file loading.
wrzonance/SpecR#55: Touches the MCP parse_document tool flow that this PR enriches with sectionInference and title matching.
wrzonance/SpecR#21: Adds DOCX parse output used by parser wiring; intersects with src/parser/index.ts changes here.

Sequence Diagram(s)

sequenceDiagram
  participant Client
  participant MCP_Handler
  participant Parser
  participant DB
  participant Persistor
  Client->>MCP_Handler: parse_document(filename, contentBase64)
  MCP_Handler->>Parser: parse(buffer, filename)
  Parser-->>MCP_Handler: ParseResult (tree, sectionInference)
  alt content-driven inference present
    MCP_Handler->>DB: lookupCsiSectionTitle(inferredSection)
    DB-->>MCP_Handler: standardTitle|null
    MCP_Handler->>MCP_Handler: computeTitleMatch(inferredTitle, standardTitle)
  end
  MCP_Handler->>Persistor: persistParsedSpec(tree)
  Persistor-->>MCP_Handler: specId
  MCP_Handler-->>Client: JSON response (specId, section, title, sectionInference?, nodeCount)

"🐰
I hop through lines of text and code,
I sniff the numbers down the road,
I match a title, fuzzy-bright,
I warn when inference takes the flight,
CSI hums softly — metadata glowed."

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'feat(lib): section/title inference — multi-pattern cascade, CSI lookup, Jaccard fuzzy match' directly and clearly summarizes the main change: addition of section/title inference with specific implementation details (multi-pattern cascade, CSI lookup, fuzzy matching). It accurately reflects the primary feature being introduced.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/section-inference

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…nt in CI

coderabbitai

Actionable comments posted: 7

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/mcp/tools.ts (1)

130-157: ⚠️ Potential issue | 🔴 Critical | ⚡ Quick win

guardPath() is still bypassable via symlinks.

path.resolve() + startsWith() only validates the lexical path. A symlink inside projectRoot that points outside the root still passes this check, and loadFiles() will then read the external target. Resolve both the root and each candidate with realpath() (or reject symlinks) before doing the containment check.

🔒 Hardening sketch

-import { glob } from 'node:fs/promises';
+import { glob, realpath } from 'node:fs/promises';
...
-function guardPath(fp: string, projectRoot: string): ToolError | null {
-  const abs = path.resolve(fp);
-  if (!abs.startsWith(projectRoot + path.sep) && abs !== projectRoot) {
+async function guardPath(fp: string, projectRoot: string): Promise<ToolError | null> {
+  const root = await realpath(projectRoot);
+  const abs = await realpath(path.resolve(projectRoot, fp));
+  if (!abs.startsWith(root + path.sep) && abs !== root) {
     return toolError(`path is outside project root: ${fp}`);
   }
   return null;
 }

Then await guardPath() in both resolution helpers before accepting a candidate path.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/mcp/tools.ts` around lines 130 - 157, guardPath currently uses lexical
path.resolve and startsWith which is bypassable via symlinks; make guardPath
async and canonicalize both projectRoot and the candidate using
fs.promises.realpath (or reject symlinks via lstat) and then perform the
containment check against the real projectRoot (ensure you compare with path.sep
similarly). Update resolveGlobPaths to await guardPath for each resolved match
(use the realpath of each match before checking) and update resolveExplicitPaths
to await guardPath for each explicit path and push the canonicalized real paths;
also ensure any callers like loadFiles receive/operate on the real (canonical)
paths rather than the original lexical ones.

🧹 Nitpick comments (5)

src/lib/infer-section.integration.test.ts (1)

17-21: ⚡ Quick win

Simplify null handling in title assertion.

The null coalescing to empty string on line 20 is unnecessary. You can directly assert on the title value for clearer intent.

♻️ Proposed simplification

   it('returns standard title for known CSI section', async () => {
     const title = await lookupCsiSectionTitle('27 10 00');
-    expect(typeof title).toBe('string');
-    expect((title ?? '').length).toBeGreaterThan(0);
+    expect(title).toBeTruthy();
+    expect(title).toMatch(/.+/);
   });

This avoids the awkward coercion and more directly expresses that the title should be a non-empty string.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/infer-section.integration.test.ts` around lines 17 - 21, The test
should directly assert that lookupCsiSectionTitle('27 10 00') returns a
non-empty string: in the test that calls lookupCsiSectionTitle, remove the
null-coalescing (title ?? '') and instead assert on title itself (e.g., keep
expect(typeof title).toBe('string') and add
expect(title.length).toBeGreaterThan(0) or expect(title).toBeTruthy()) so the
intent is clear and null coercion is eliminated.

src/parser/parse.test.ts (1)

60-76: ⚡ Quick win

Add a regression test for partial metadata preservation.

Please add a case where section: 'unknown' but title is already known, and inference finds section only; this should keep the original title.

🧪 Suggested test case

+  it('preserves existing title when section is inferred but title is not', async () => {
+    const partialTree: CsiTree = {
+      id: 'x',
+      section: 'unknown',
+      title: 'Known Title',
+      parts: [{ id: 'n1', type: 'part', text: 'SECTION 26 09 33', children: [], meta: {} }],
+    };
+    vi.mocked(parseSec).mockReturnValue({ tree: partialTree, refs: [] });
+    const result = await parse(Buffer.from(''), 'spec.sec');
+    expect(result.tree.section).toBe('26 09 33');
+    expect(result.tree.title).toBe('Known Title');
+  });

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/parser/parse.test.ts` around lines 60 - 76, Add a new unit test in
parse.test.ts that mirrors the existing "updates tree section and title when
inference fires on unknown section" test but uses a CsiTree with section:
'unknown' and title: 'MOTOR CONTROLLERS' (title already known). Mock parseSec
(vi.mocked(parseSec).mockReturnValue(...)) to return that tree and then call
parse(Buffer.from(''), 'spec.sec'); assert that result.tree.section === '26 09
33' (inferred) and result.tree.title === 'MOTOR CONTROLLERS' (unchanged), and
validate result.sectionInference.method === 'content-high' and
result.sectionInference.confidence === 'high' to ensure only the section was
updated while preserving existing title.

src/lib/file-loader.ts (2)

33-38: ⚡ Quick win

Don’t swallow CSI title lookup failures without context.

At Line 36, the catch block drops the original error entirely, so DB outages look the same as “no matching section.” Capture and log the error context before falling back to null.

Suggested patch

 async function resolveStandardTitle(section: string): Promise<string | null> {
   try {
     return await lookupCsiSectionTitle(section);
-  } catch {
+  } catch (err) {
+    logger.warn({ err, section }, 'lookupCsiSectionTitle failed; continuing without standard title');
     return null;
   }
 }

As per coding guidelines, "Custom error class per module boundary extends SpecrError, use cause chaining at every catch site to preserve context."

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/file-loader.ts` around lines 33 - 38, The catch in
resolveStandardTitle currently swallows lookupCsiSectionTitle failures; create a
module-specific error class (e.g., FileLoaderError) that extends SpecrError and
update the catch to capture the original error (catch (err)), log the error
context, and rethrow a new FileLoaderError with a descriptive message and the
original error as the cause (using cause chaining) so callers preserve failure
context instead of silently returning null.

76-87: ⚡ Quick win

Avoid mutating inferenceWarnings in place.

At Line 86, .push() mutates shared state. Prefer returning the warning from processFile() and aggregating immutably in loadFiles().

Suggested patch

 async function processFile(
   file: string,
-  dryRun: boolean,
-  inferenceWarnings: InferenceWarning[]
-): Promise<void> {
+  dryRun: boolean
+): Promise<InferenceWarning | null> {
   const buffer = await readFile(file);
   const result = await parse(buffer, file);
-  if (dryRun) return;
+  if (dryRun) return null;
   const specId = await persistParsedSpec(result);
-  const warning = await buildInferenceWarning(file, specId, result.sectionInference);
-  if (warning) inferenceWarnings.push(warning);
+  return await buildInferenceWarning(file, specId, result.sectionInference);
 }

-  const inferenceWarnings: InferenceWarning[] = [];
+  let inferenceWarnings: ReadonlyArray<InferenceWarning> = [];
 ...
-      await processFile(file, opts?.dryRun ?? false, inferenceWarnings);
+      const warning = await processFile(file, opts?.dryRun ?? false);
+      if (warning) inferenceWarnings = [...inferenceWarnings, warning];

As per coding guidelines, "Immutable patterns: create new objects, never mutate. Use spread operators, not property assignment."

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/file-loader.ts` around lines 76 - 87, processFile currently mutates
the shared inferenceWarnings array with inferenceWarnings.push; change
processFile(file, dryRun, ...) to return the built warning (or null/undefined)
instead of mutating state: keep readFile, parse, persistParsedSpec and call
buildInferenceWarning(file, specId, result.sectionInference) then return that
value; update the caller loadFiles (or any caller that passed the
inferenceWarnings array) to collect warnings immutably (e.g., await Promise.all
on processFile calls and combine results using filter and array spread/concat
into a new array) so no in-place mutation occurs.

src/mcp/handlers.ts (1)

62-67: ⚡ Quick win

Don't mask CSI title lookup failures as a missing row.

This catch turns every database/query failure into null, which suppresses title-match enrichment and makes outages indistinguishable from "section not found". At minimum, log the exception before degrading so operators can see why inference warnings disappeared.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/mcp/handlers.ts` around lines 62 - 67, The function
resolveStandardTitleForMcp currently swallows all errors from
lookupCsiSectionTitle and returns null; change the catch to capture the thrown
error (e.g., catch (err)) and log the exception before returning null so
operational failures are visible — update the catch block in
resolveStandardTitleForMcp to call the existing logger (or console.error) with a
clear message and the error and then return null; keep the return behavior so
callers are unchanged but failures are now recorded for debugging.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@docs/superpowers/specs/2026-05-16-section-inference-design.md`:
- Around line 64-65: The spec is inconsistent about where fuzzy matching and the
CSI title column live; unify them by specifying that fuzzy-match logic runs in
inferSectionMeta after standardTitle is derived (enrichment step), and update
the DB/query contract to use a single canonical column name (choose either title
or section_title) across the document and examples; change all references in the
spec and DB snippet (and the CSI query contract) to the chosen column name and
add a short note in the inferSectionMeta/enrichment section describing that
fuzzy matching uses standardTitle and writes/queries the canonical column (refer
to inferSectionMeta, standardTitle, enrichment, and the DB snippet/CSI query
contract to locate places to update).

In `@src/db/queries/search.ts`:
- Around line 68-73: The DB result from lookupCsiSectionTitle must be
runtime-validated with Zod instead of relying on the TypeScript generic; add a
Zod schema like z.object({ title: z.string() }), import z from 'zod', and
validate result.rows[0] (or handle missing rows) using safeParse/parse; if
validation succeeds return the validated.title, otherwise handle the failure
(return null or log/throw consistently with existing error handling). Update
lookupCsiSectionTitle to call the validator before returning and ensure any
invalid shape is not returned to callers.

In `@src/lib/infer-section.integration.test.ts`:
- Around line 34-38: The test currently only asserts inferredSection and tree
update when result.sectionInference.confidence !== 'none', which lets a case
where method !== 'none' but confidence === 'none' skip important checks; replace
the conditional skip with explicit assertions (e.g.,
expect(result.sectionInference.confidence).not.toBe('none')) and then always
assert that result.sectionInference.inferredSection matches /^\d{2} \d{2}
\d{2}$/ and that result.tree.section equals
result.sectionInference.inferredSection so both method, confidence and the
inferredSection/tree update are validated (refer to
result.sectionInference.method, result.sectionInference.confidence,
result.sectionInference.inferredSection, and result.tree.section).
- Around line 8-10: The test references fixture constants PROJECT_ROOT,
ARCAT_DOCX, and UFGS_SEC and should not run in CI when those files are missing;
add a sentinel-file gating check (e.g., use fs.existsSync or the existing
describe.skipIf helper) at the top of infer-section.integration.test.ts to
detect whether ARCAT_DOCX (or a designated sentinel file) exists and wrap the
fixture-dependent describe blocks with describe.skipIf(!fixtureExists, ...) so
the tests are skipped when fixtures are absent; update the test file to compute
fixtureExists from ARCAT_DOCX/ UFGS_SEC and use that symbol in the
describe.skipIf call.

In `@src/lib/infer-section.ts`:
- Around line 13-15: The regexes KEYWORD_RE and INLINE_TITLE_RE match "SECTION"
inside other words (e.g., "INTERSECTION"), causing false positives; update both
patterns to require word boundaries around SECTION (use \b before and after
SECTION) so they only match whole-word "SECTION" while preserving the rest of
each pattern and case-insensitive flag.

In `@src/mcp/handlers.ts`:
- Around line 193-198: handleParseDocument currently bypasses the shared
dispatcher and rebuilds inference by calling parseSec/parseDocx then
enrichInferenceForMcp and persistParsedSpec, which causes MCP-specific
divergence and emits sectionInference incorrectly; instead, use the
parser/inference result already provided by the dispatcher (do not call
enrichInferenceForMcp on a fresh parse), feed that shared inference tree into
persistParsedSpec (and only emit sectionInference for non-metadata warning cases
as the dispatcher does), removing the extra parse+enrich path around
parseSec/parseDocx, enrichInferenceForMcp and ensuring handleParseDocument
consumes and persists the dispatcher-owned tree and refs exactly as the rest of
the stack does.

In `@src/parser/index.ts`:
- Around line 23-24: applyInference currently overwrites an existing tree.title
with the inferredTitle value 'unknown'; change the return so title is only
replaced when the inferredTitle is not the sentinel 'unknown' (or when
tree.title is missing). Concretely, inside the function that contains the
snippet (applyInference), compute a new title like: if inference.inferredTitle
=== 'unknown' && tree.title then keep tree.title else use
inference.inferredTitle, and return { ...tree, section:
inference.inferredSection, title: computedTitle } so section can still be
updated while preserving an existing known title.

---

Outside diff comments:
In `@src/mcp/tools.ts`:
- Around line 130-157: guardPath currently uses lexical path.resolve and
startsWith which is bypassable via symlinks; make guardPath async and
canonicalize both projectRoot and the candidate using fs.promises.realpath (or
reject symlinks via lstat) and then perform the containment check against the
real projectRoot (ensure you compare with path.sep similarly). Update
resolveGlobPaths to await guardPath for each resolved match (use the realpath of
each match before checking) and update resolveExplicitPaths to await guardPath
for each explicit path and push the canonicalized real paths; also ensure any
callers like loadFiles receive/operate on the real (canonical) paths rather than
the original lexical ones.

---

Nitpick comments:
In `@src/lib/file-loader.ts`:
- Around line 33-38: The catch in resolveStandardTitle currently swallows
lookupCsiSectionTitle failures; create a module-specific error class (e.g.,
FileLoaderError) that extends SpecrError and update the catch to capture the
original error (catch (err)), log the error context, and rethrow a new
FileLoaderError with a descriptive message and the original error as the cause
(using cause chaining) so callers preserve failure context instead of silently
returning null.
- Around line 76-87: processFile currently mutates the shared inferenceWarnings
array with inferenceWarnings.push; change processFile(file, dryRun, ...) to
return the built warning (or null/undefined) instead of mutating state: keep
readFile, parse, persistParsedSpec and call buildInferenceWarning(file, specId,
result.sectionInference) then return that value; update the caller loadFiles (or
any caller that passed the inferenceWarnings array) to collect warnings
immutably (e.g., await Promise.all on processFile calls and combine results
using filter and array spread/concat into a new array) so no in-place mutation
occurs.

In `@src/lib/infer-section.integration.test.ts`:
- Around line 17-21: The test should directly assert that
lookupCsiSectionTitle('27 10 00') returns a non-empty string: in the test that
calls lookupCsiSectionTitle, remove the null-coalescing (title ?? '') and
instead assert on title itself (e.g., keep expect(typeof title).toBe('string')
and add expect(title.length).toBeGreaterThan(0) or expect(title).toBeTruthy())
so the intent is clear and null coercion is eliminated.

In `@src/mcp/handlers.ts`:
- Around line 62-67: The function resolveStandardTitleForMcp currently swallows
all errors from lookupCsiSectionTitle and returns null; change the catch to
capture the thrown error (e.g., catch (err)) and log the exception before
returning null so operational failures are visible — update the catch block in
resolveStandardTitleForMcp to call the existing logger (or console.error) with a
clear message and the error and then return null; keep the return behavior so
callers are unchanged but failures are now recorded for debugging.

In `@src/parser/parse.test.ts`:
- Around line 60-76: Add a new unit test in parse.test.ts that mirrors the
existing "updates tree section and title when inference fires on unknown
section" test but uses a CsiTree with section: 'unknown' and title: 'MOTOR
CONTROLLERS' (title already known). Mock parseSec
(vi.mocked(parseSec).mockReturnValue(...)) to return that tree and then call
parse(Buffer.from(''), 'spec.sec'); assert that result.tree.section === '26 09
33' (inferred) and result.tree.title === 'MOTOR CONTROLLERS' (unchanged), and
validate result.sectionInference.method === 'content-high' and
result.sectionInference.confidence === 'high' to ensure only the section was
updated while preserving existing title.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: 4b606dba-f97a-4653-ba28-5a0f85d6eefc

📥 Commits

Reviewing files that changed from the base of the PR and between d708f2f and 22780bb.

📒 Files selected for processing (13)

docs/superpowers/plans/2026-05-16-section-inference.md
docs/superpowers/specs/2026-05-16-section-inference-design.md
src/db/index.ts
src/db/queries/search.ts
src/lib/file-loader.test.ts
src/lib/file-loader.ts
src/lib/infer-section.integration.test.ts
src/lib/infer-section.test.ts
src/lib/infer-section.ts
src/mcp/handlers.ts
src/mcp/tools.ts
src/parser/index.ts
src/parser/parse.test.ts

…update inference note strings

… async realpath guardPath, spec doc fixes

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/lib/infer-section.ts`:
- Around line 144-145: The empty catch should be changed to capture the error
and log it with cause-chaining before returning the defensive NONE_RESULT;
update the catch to "catch (err)" in the inferSection function (or the
surrounding function where NONE_RESULT is returned), call the module's logger
(e.g., logger or processLogger) to log a clear message like "Failed to infer
section" and pass an Error constructed with the original error as its cause (new
Error('Failed to infer section', { cause: err })) or include { cause: err } in
the log call, then return NONE_RESULT.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: eb60a74e-40ce-4ac8-b0c2-0719d0ccd980

📥 Commits

Reviewing files that changed from the base of the PR and between 90e6014 and f40d094.

📒 Files selected for processing (5)

docs/superpowers/specs/2026-05-16-section-inference-design.md
src/lib/infer-section.integration.test.ts
src/lib/infer-section.ts
src/mcp/tools.ts
src/parser/index.ts

✅ Files skipped from review due to trivial changes (1)

docs/superpowers/specs/2026-05-16-section-inference-design.md

🚧 Files skipped from review as they are similar to previous changes (3)

src/lib/infer-section.integration.test.ts
src/mcp/tools.ts
src/parser/index.ts

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/mcp/tools.ts`:
- Around line 209-210: Don't use process.cwd() as the trust boundary; instead
resolve relative paths from the MCP-configured project root and pass that into
collectResolvedPaths. Replace the use of process.cwd() when building projectRoot
with the authoritative project root value provided by the MCP context/config
(e.g., use the existing projectRoot/config.projectRoot/serverContext.projectRoot
or add a projectRoot parameter to the caller) so that
collectResolvedPaths(globPattern, explicitPaths, projectRoot) operates against
the correct repository tree rather than the launch directory.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: 4549c04b-1e1c-456d-9837-10c63c410557

📥 Commits

Reviewing files that changed from the base of the PR and between f40d094 and 70a2b24.

📒 Files selected for processing (1)

src/mcp/tools.ts

…m root edge case

… infer-section

thewrz added 10 commits May 16, 2026 20:40

docs: section/title inference design spec — multi-pattern cascade, CS…

6de005b

…I lookup, fuzzy title match

docs: section inference implementation plan — 6 tasks, closes section…

58ceaf8

…/title unknown gap

feat(db): add lookupCsiSectionTitle — CSI ground-truth title lookup b…

ab9a3d6

…y section number

feat(lib): inferSectionMeta — multi-pattern cascade + computeTitleMat…

d43ec50

…ch word-Jaccard

fix(lib): correct title search window to +10 nodes, add boundary tests

6f9c519

feat(parser): wire inferSectionMeta into parse() — applyInference upd…

13f3ed3

…ates tree on content match

feat(lib): InferenceWarning in LoadResult — CSI lookup + Jaccard fuzz…

e3afe0c

…y match per file

feat(mcp): parse_document response includes sectionInference when con…

a7ea6e3

…tent inference fires

test(lib): infer-section integration tests — ARCAT DOCX inference + C…

5400860

…SI lookup + loadFiles warnings

test(lib): gate ARCAT fixture tests on existsSync — fixture not prese…

d93a801

…nt in CI

coderabbitai Bot reviewed May 17, 2026

View reviewed changes

thewrz added 2 commits May 16, 2026 20:52

docs(adr): ADR-012 — UFGS is reference corpus not authoritative CSI; …

90e6014

…update inference note strings

fix(lib,mcp,parser): word boundary regex, applyInference title guard,…

f40d094

… async realpath guardPath, spec doc fixes

coderabbitai Bot reviewed May 17, 2026

View reviewed changes

Comment thread src/lib/infer-section.ts

fix(mcp): add error logging to guardPath catch blocks for observability

70a2b24

coderabbitai Bot reviewed May 17, 2026

View reviewed changes

Comment thread src/mcp/tools.ts

Comment thread src/mcp/tools.ts

fix(mcp): use path.relative for containment check — handles filesyste…

a903e6c

…m root edge case

thewrz commented May 17, 2026

View reviewed changes

Comment thread src/lib/infer-section.ts

refactor(lib): extract TITLE_MIN_LENGTH/TITLE_MAX_LENGTH constants in…

67cbef5

… infer-section

thewrz merged commit 0bf9564 into main May 17, 2026
6 checks passed

thewrz deleted the feat/section-inference branch May 17, 2026 04:25

Conversation

thewrz commented May 17, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Sequence Diagram(s)

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

thewrz commented May 17, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 17, 2026 •

edited

Loading