fix: support reasoning compatible endpoints (Fixes #3279) by deepujain · Pull Request #3286 · NVIDIA/NemoClaw

deepujain · 2026-05-08T20:31:31Z

Summary

Custom OpenAI-compatible providers can now opt into reasoning-mode validation through NEMOCLAW_REASONING. When that flag is enabled, NemoClaw skips the Responses/tool-call probe that reasoning-only wrappers often reject, and the sandbox smoke check accepts reasoning text as valid output.

Fixes #3279.

Changes

Normalized NEMOCLAW_REASONING aliases such as yes, 1, no, and 0.
Preserved the current non-reasoning default when the variable is unset.
Skipped Responses/tool-call and streaming probes for reasoning-mode compatible endpoints.
Increased the sandbox smoke response budget and accepted reasoning_content or reasoning when message.content is empty.
Added focused tests for the flag normalization and custom endpoint reasoning path.

Testing

npm run build:cli passed.
npm test -- test/onboard.test.ts -t "normalizes compatible-endpoint reasoning" passed.
npm test -- test/onboard-selection.test.ts -t "honors NEMOCLAW_REASONING" passed.
HOME=/private/tmp/nemoclaw-test-home-3279b npm test -- --reporter=dot was attempted. The local full suite still fails in existing installer/preinstall tests, including test/install-preflight.test.ts and test/preinstall-node-version.test.ts.

Evidence it works

The focused custom endpoint test simulates a reasoning-only chat response with empty content and non-empty reasoning_content; with NEMOCLAW_REASONING=yes, onboarding accepts the provider and keeps preferredInferenceApi on chat completions.

Summary by CodeRabbit

New Features
- Added a reasoning mode for OpenAI-compatible endpoints, normalized from common aliases (yes/1/no/0) and persisted through onboarding/resume; defaults to off.
- Endpoint validation adapts to reasoning mode (alters probe behavior, toggles streaming) and accepts multiple response formats; sandbox probe uses a larger token budget.
Tests
- New tests cover reasoning-mode normalization and onboarding probe/selection behavior when reasoning is enabled.

Fixes NVIDIA#3279 Signed-off-by: Deepak Jain <deepujain@gmail.com>

copy-pr-bot · 2026-05-08T20:31:34Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-05-08T20:31:43Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: c16a7f6e-4e80-44dd-9ec2-be4ea944bccd

📥 Commits

Reviewing files that changed from the base of the PR and between 2c1672b and 747848c.

📒 Files selected for processing (1)

src/lib/state/onboard-session.ts

📝 Walkthrough

Walkthrough

Adds normalization and configuration helpers for a compatible-endpoint reasoning mode, persists the flag in onboarding session state, applies it during provider selection/resume, conditions custom OpenAI-like endpoint validation and smoke probes on the flag (including larger token budget and reasoning-field fallbacks), and adds unit/integration tests.

Changes

Reasoning mode support for OpenAI-compatible endpoints

Layer / File(s)	Summary
Reasoning helpers `src/lib/onboard.ts`	Adds `normalizeReasoningFlag()` and `configureCompatibleEndpointReasoning()` and exports them; they normalize common aliases and mirror the resolved value into `process.env.NEMOCLAW_REASONING`.
Session persistence `src/lib/state/onboard-session.ts`	Adds `compatibleEndpointReasoning: string
Provider selection & probe wiring `src/lib/onboard.ts`	Calls `configureCompatibleEndpointReasoning()` during custom endpoint setup and resume, stores/clears the flag in setup flows, includes it in toSessionUpdates(), and modifies `validateCustomOpenAiLikeSelection()` to branch probe options on `NEMOCLAW_REASONING === "true"` (tool-calling, responses probe, streaming).
Smoke test `src/lib/onboard.ts`	Increases smoke probe `max_tokens` to 512 and accepts response text from `message.content` with fallbacks to `message.reasoning_content` and `message.reasoning`.
Helper exports `src/lib/onboard.ts`	Exports `normalizeReasoningFlag` and `configureCompatibleEndpointReasoning` via `module.exports`.
Unit tests `test/onboard.test.ts`	Extends `OnboardTestInternals` to include the two helpers and adds a unit test for alias normalization and environment config behavior.
Integration test `test/onboard-selection.test.ts`	Adds a test that sets `NEMOCLAW_REASONING=yes` with a custom endpoint, verifies the reasoning-model selection, that validation uses `/chat/completions` (not `/responses`) and does not use streaming, and that the reasoning-mode onboarding prompt is suppressed.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐰 I nudge the flag from yes to true,
I hop where reasoning fields accrue,
Custom endpoints hear the call,
content or reasoning — we accept them all,
A little normalization, that's my cue.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 40.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically summarizes the main change: adding support for reasoning-mode compatible endpoints, directly addressing the primary fix objective.
Linked Issues check	✅ Passed	The PR fully implements the requirements from `#3279`: NEMOCLAW_REASONING is now configurable for Option 3 providers, reasoning responses are correctly extracted, and reasoning-mode probes are properly skipped.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to implementing reasoning-mode support for compatible endpoints; no unrelated modifications are present in onboard.ts, test files, or session state.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

test/onboard-selection.test.ts (1)

2075-2094: ⚡ Quick win

Assert probe path behavior explicitly in reasoning mode.

Line [2077] and Line [2150] currently validate end-state only; this test can still pass if /responses (or streaming probe) is called and then fallback succeeds. Please log curl args and assert reasoning mode avoids /responses/-N while still hitting /chat/completions.

Suggested test hardening

@@
     const scriptPath = path.join(tmpDir, "custom-openai-reasoning-check.js");
+    const curlArgsLog = path.join(tmpDir, "custom-openai-reasoning-curl-args.log");
@@
       path.join(fakeBin, "curl"),
       `#!/usr/bin/env bash
+args_log=${JSON.stringify(curlArgsLog)}
+printf '%s\\n' "$*" >> "$args_log"
 body='{"error":{"message":"bad request"}}'
 status="400"
 outfile=""
 url=""
@@
     assert.equal(payload.result.model, "reasoning-model");
     assert.equal(payload.result.preferredInferenceApi, "openai-completions");
     assert.equal(payload.reasoning, "true");
+    const curlInvocations = fs.readFileSync(curlArgsLog, "utf-8");
+    assert.match(curlInvocations, /chat\/completions/);
+    assert.doesNotMatch(curlInvocations, /\/responses/);
+    assert.doesNotMatch(curlInvocations, /(^|\s)-N(\s|$)/);
     assert.ok(
       payload.messages.every(
         (message: string) => !/Enable reasoning mode for this model/.test(message),
       ),
     );

Also applies to: 2150-2160

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/onboard-selection.test.ts` around lines 2075 - 2094, The fake curl
script written to path.join(fakeBin, "curl") should record its received
arguments so the test can assert probe behavior: modify the heredoc to append
the full "$@" or each processed arg into a temp log file (e.g.,
"$outfile.args.log" or a known capture file in the test dir) before handling
-o/url logic, and keep returning the same body/status logic; then update the
test assertions to read that captured args log and explicitly assert that in
reasoning mode the captured args do not include '/responses' nor the '-N' flag
and do include a call to '/chat/completions'. Ensure you reference the same
fakeBin/curl generation code and the test's reasoning-mode invocation when
adding the new assertions.

test/onboard.test.ts (1)

391-410: ⚡ Quick win

Add an explicit assertion for the unset/default reasoning path.

This test validates aliases, but not the key default behavior when NEMOCLAW_REASONING is unset. Locking that in will better guard the PR’s intended fallback semantics.

✅ Suggested test addition

   expect(normalizeReasoningFlag("maybe")).toBeNull();

+  delete process.env.NEMOCLAW_REASONING;
+  await expect(configureCompatibleEndpointReasoning()).resolves.toBe("false");
+  expect(process.env.NEMOCLAW_REASONING).toBe("false");
+
   process.env.NEMOCLAW_REASONING = "yes";
   await expect(configureCompatibleEndpointReasoning()).resolves.toBe("true");
   expect(process.env.NEMOCLAW_REASONING).toBe("true");

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/onboard.test.ts` around lines 391 - 410, Add an explicit assertion that
the "unset/default" path is covered: before setting NEMOCLAW_REASONING to "yes",
delete process.env.NEMOCLAW_REASONING (or set it to undefined) and call await
expect(configureCompatibleEndpointReasoning()).resolves.toBe(normalizeReasoningFlag(undefined));
this ensures the test asserts the function's fallback behavior when
NEMOCLAW_REASONING is not present and references the existing
normalizeReasoningFlag and configureCompatibleEndpointReasoning symbols.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/lib/onboard.ts`:
- Around line 1508-1511: The configureCompatibleEndpointReasoning function
currently stores the normalized NEMOCLAW_REASONING in process.env only, causing
stale or missing values across onboarding flows; update it to persist the
normalized value into the selected provider state object (use the same
normalization via normalizeReasoningFlag) instead of relying solely on
process.env, and ensure that when provider selection changes the stored flag is
cleared from that provider state and process.env is restored/updated
accordingly; locate configureCompatibleEndpointReasoning and related calls to
normalizeReasoningFlag and modify the provider selection/restore logic where
provider state is mutated so the flag is saved with the provider, removed on
selection change, and used to seed process.env when a provider is active (also
apply the same change pattern to the other occurrence referenced near the file's
later provider-state handling).

---

Nitpick comments:
In `@test/onboard-selection.test.ts`:
- Around line 2075-2094: The fake curl script written to path.join(fakeBin,
"curl") should record its received arguments so the test can assert probe
behavior: modify the heredoc to append the full "$@" or each processed arg into
a temp log file (e.g., "$outfile.args.log" or a known capture file in the test
dir) before handling -o/url logic, and keep returning the same body/status
logic; then update the test assertions to read that captured args log and
explicitly assert that in reasoning mode the captured args do not include
'/responses' nor the '-N' flag and do include a call to '/chat/completions'.
Ensure you reference the same fakeBin/curl generation code and the test's
reasoning-mode invocation when adding the new assertions.

In `@test/onboard.test.ts`:
- Around line 391-410: Add an explicit assertion that the "unset/default" path
is covered: before setting NEMOCLAW_REASONING to "yes", delete
process.env.NEMOCLAW_REASONING (or set it to undefined) and call await
expect(configureCompatibleEndpointReasoning()).resolves.toBe(normalizeReasoningFlag(undefined));
this ensures the test asserts the function's fallback behavior when
NEMOCLAW_REASONING is not present and references the existing
normalizeReasoningFlag and configureCompatibleEndpointReasoning symbols.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 78ba4585-9297-4c89-b6ae-5ad64ecf87cb

📥 Commits

Reviewing files that changed from the base of the PR and between f1568f6 and 077cb4b.

📒 Files selected for processing (3)

src/lib/onboard.ts
test/onboard-selection.test.ts
test/onboard.test.ts

Signed-off-by: Test User <test@example.com>

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/lib/state/onboard-session.ts (1)

707-717: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Persist compatibleEndpointReasoning in the safe update whitelist.

SessionUpdates adds this field (Line 131), but filterSafeUpdates() never forwards it, so updates passed via markStepComplete() / completeSession() are silently dropped. That breaks both setting and clearing (null) this value during provider transitions.

Suggested fix

 export function filterSafeUpdates(updates: SessionUpdates): Partial<Session> {
   const safe: Partial<Session> = {};
   if (!isObject(updates)) return safe;
   if (typeof updates.sandboxName === "string") safe.sandboxName = updates.sandboxName;
   if (typeof updates.provider === "string") safe.provider = updates.provider;
   if (typeof updates.model === "string") safe.model = updates.model;
   if (typeof updates.endpointUrl === "string") safe.endpointUrl = redactUrl(updates.endpointUrl);
   if (typeof updates.credentialEnv === "string") safe.credentialEnv = updates.credentialEnv;
   if (typeof updates.preferredInferenceApi === "string")
     safe.preferredInferenceApi = updates.preferredInferenceApi;
+  if (typeof updates.compatibleEndpointReasoning === "string") {
+    safe.compatibleEndpointReasoning = updates.compatibleEndpointReasoning;
+  } else if (updates.compatibleEndpointReasoning === null) {
+    safe.compatibleEndpointReasoning = null;
+  }
   if (typeof updates.nimContainer === "string") safe.nimContainer = updates.nimContainer;

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/state/onboard-session.ts` around lines 707 - 717, filterSafeUpdates
currently omits the SessionUpdates field compatibleEndpointReasoning so updates
from markStepComplete/completeSession are dropped; update the function
filterSafeUpdates to accept and propagate compatibleEndpointReasoning (including
allowing null to clear it) into the returned Partial<Session> when typeof
updates.compatibleEndpointReasoning is "string" or when
updates.compatibleEndpointReasoning === null, mirroring how other
nullable/optional fields are handled so provider transitions can set and clear
this value.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Outside diff comments:
In `@src/lib/state/onboard-session.ts`:
- Around line 707-717: filterSafeUpdates currently omits the SessionUpdates
field compatibleEndpointReasoning so updates from
markStepComplete/completeSession are dropped; update the function
filterSafeUpdates to accept and propagate compatibleEndpointReasoning (including
allowing null to clear it) into the returned Partial<Session> when typeof
updates.compatibleEndpointReasoning is "string" or when
updates.compatibleEndpointReasoning === null, mirroring how other
nullable/optional fields are handled so provider transitions can set and clear
this value.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 06d7e88a-98bf-4a0d-89a9-ddd0edb6c09a

📥 Commits

Reviewing files that changed from the base of the PR and between 077cb4b and 66bbe29.

📒 Files selected for processing (4)

src/lib/onboard.ts
src/lib/state/onboard-session.ts
test/onboard-selection.test.ts
test/onboard.test.ts

🚧 Files skipped from review as they are similar to previous changes (3)

test/onboard-selection.test.ts
test/onboard.test.ts
src/lib/onboard.ts

Signed-off-by: Test User <test@example.com>

coderabbitai

🧹 Nitpick comments (1)

src/lib/state/onboard-session.ts (1)

145-166: ⚡ Quick win

Include compatibleEndpointReasoning in debug summary output.

The new persisted field is not exposed by DebugSessionSummary/summarizeForDebug, which makes reasoning-mode resume issues harder to diagnose from debug dumps.

Proposed patch

 export interface DebugSessionSummary {
   version: number;
   sessionId: string;
   status: string;
   resumable: boolean;
   mode: string;
   startedAt: string;
   updatedAt: string;
   sandboxName: string | null;
   provider: string | null;
   model: string | null;
   endpointUrl: string | null;
   credentialEnv: string | null;
   preferredInferenceApi: string | null;
+  compatibleEndpointReasoning: string | null;
   nimContainer: string | null;
   policyPresets: string[] | null;
   gpuPassthrough: boolean;
   lastStepStarted: string | null;
   lastCompletedStep: string | null;
   failure: SessionFailure | null;
   steps: Record<string, StepState>;
 }
@@
     endpointUrl: redactUrl(session.endpointUrl),
     credentialEnv: session.credentialEnv,
     preferredInferenceApi: session.preferredInferenceApi,
+    compatibleEndpointReasoning: session.compatibleEndpointReasoning,
     nimContainer: session.nimContainer,
     policyPresets: session.policyPresets,

Also applies to: 852-867

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/state/onboard-session.ts` around lines 145 - 166, DebugSessionSummary
and the summarizeForDebug output are missing the new persisted field
compatibleEndpointReasoning, so add compatibleEndpointReasoning: string | null
(matching the persisted type) to the DebugSessionSummary interface and update
the summarizeForDebug function to read the session-compatibleEndpointReasoning
value and include it in the returned summary object (alongside fields like
failure, steps, nimContainer, preferredInferenceApi, etc.) so debug dumps expose
reasoning-mode compatibility; update any related spots mentioned (e.g., the
summarizeForDebug implementation around the lastStep/lastCompletedStep logic) to
populate this field.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@src/lib/state/onboard-session.ts`:
- Around line 145-166: DebugSessionSummary and the summarizeForDebug output are
missing the new persisted field compatibleEndpointReasoning, so add
compatibleEndpointReasoning: string | null (matching the persisted type) to the
DebugSessionSummary interface and update the summarizeForDebug function to read
the session-compatibleEndpointReasoning value and include it in the returned
summary object (alongside fields like failure, steps, nimContainer,
preferredInferenceApi, etc.) so debug dumps expose reasoning-mode compatibility;
update any related spots mentioned (e.g., the summarizeForDebug implementation
around the lastStep/lastCompletedStep logic) to populate this field.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: f8e80300-8c3d-41f3-be29-8da2e7314cd8

📥 Commits

Reviewing files that changed from the base of the PR and between 66bbe29 and 2c1672b.

📒 Files selected for processing (1)

src/lib/state/onboard-session.ts

Signed-off-by: Test User <test@example.com>

wscurran · 2026-05-11T14:39:58Z

✨ Thanks for submitting this detailed PR about supporting reasoning-compatible endpoints for custom OpenAI providers. This change aims to improve the compatibility of NemoClaw with reasoning-mode validation through the NEMOCLAW_REASONING flag.

Related open issues:

#3279 NEMOCLAW_REASONING not configurable for Option 3 providers; reasoning-only models fail silently

fix(onboard): support reasoning compatible endpoints

077cb4b

Fixes NVIDIA#3279 Signed-off-by: Deepak Jain <deepujain@gmail.com>

coderabbitai Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/lib/onboard.ts Outdated

fix: persist compatible endpoint reasoning state

66bbe29

Signed-off-by: Test User <test@example.com>

coderabbitai Bot reviewed May 8, 2026

View reviewed changes

fix: persist reasoning state through session updates

2c1672b

Signed-off-by: Test User <test@example.com>

coderabbitai Bot reviewed May 8, 2026

View reviewed changes

fix: expose reasoning mode in debug summary

747848c

Signed-off-by: Test User <test@example.com>

ericksoa added the v0.0.39 Release target label May 9, 2026

wscurran added enhancement: provider Use this label to identify requests to add a new AI provider to NemoClaw. Provider: OpenAI Use this label to identify issues with the OpenAI provider integration. fix and removed v0.0.39 Release target labels May 11, 2026

cv closed this May 12, 2026

cv reopened this May 12, 2026

wscurran added enhancement: inference Items related to running (local or hosted) inference models from NemoClaw. status: rebase PR needs to be rebased against main before review can continue labels May 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: support reasoning compatible endpoints (Fixes #3279)#3286

fix: support reasoning compatible endpoints (Fixes #3279)#3286
deepujain wants to merge 4 commits into
NVIDIA:mainfrom
deepujain:fix/3279-compatible-reasoning-flag

deepujain commented May 8, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

copy-pr-bot Bot commented May 8, 2026

Uh oh!

coderabbitai Bot commented May 8, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

wscurran commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

deepujain commented May 8, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Testing

Evidence it works

Summary by CodeRabbit

Uh oh!

copy-pr-bot Bot commented May 8, 2026

Uh oh!

coderabbitai Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

wscurran commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

deepujain commented May 8, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 8, 2026 •

edited

Loading