fix(cli): prompt before reusing existing sandbox on re-onboard by paritoshd-nv · Pull Request #1527 · NVIDIA/NemoClaw

paritoshd-nv · 2026-04-06T15:01:14Z

Add --recreate-sandbox CLI flag and interactive confirmation prompt when a sandbox with the same name already exists during onboarding. In non-interactive mode, ready sandboxes are reused with a hint about the new flag; non-ready sandboxes now exit with an error instead of silently recreating.

Closes #1102

Summary

Related Issue

Changes

Type of Change

Code change for a new feature, bug fix, or refactor.
Code change with doc updates.
Doc only. Prose changes without code sample modifications.
Doc only. Includes code sample changes.

Testing

npx prek run --all-files passes (or equivalently make check).
npm test passes.
make docs builds without warnings. (for doc-only changes)

Checklist

General

I have read and followed the contributing guide.
I have read and followed the style guide. (for doc-only changes)

Code Changes

Formatters applied — npx prek run --all-files auto-fixes formatting (or make format for targeted runs).
Tests added or updated for new or changed behavior.
No secrets, API keys, or credentials committed.
Doc pages updated for any user-facing behavior changes (new commands, changed defaults, new features, bug fixes that contradict existing docs).

Doc Changes

Follows the style guide. Try running the update-docs agent skill to draft changes while complying with the style guide. For example, prompt your agent with "/update-docs catch up the docs for the new changes I made in this PR."
New pages include SPDX license header and frontmatter, if creating a new page.
Cross-references and links verified.

Signed-off-by: Your Name your-email@example.com

Summary by CodeRabbit

New Features
- Added --recreate-sandbox CLI flag and NEMOCLAW_RECREATE_SANDBOX env var to force sandbox recreation; interactive prompt to confirm reusing an existing sandbox (defaults to reuse).
Improvements
- Non-interactive mode now fails if an existing sandbox is not ready, with guidance to use the recreate option.
- Provider-migration always deletes-and-recreates and logs a clear notice before deletion.
Tests
- Added tests for interactive and non-interactive sandbox lifecycle behaviors.

coderabbitai · 2026-04-06T15:01:27Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds a --recreate-sandbox flag / RECREATE_SANDBOX env toggle and an isRecreateSandbox() helper; gates sandbox reuse on that toggle and provider-migration, prompts interactively to reuse, errors in non-interactive stale states, and always deletes/recreates when provider migration is needed. Logs deletion note before recreation.

Changes

Cohort / File(s)	Summary
Onboarding Logic `bin/lib/onboard.js`	Add `RECREATE_SANDBOX` toggle and `isRecreateSandbox()`; change reuse gate to `!isRecreateSandbox() && !needsProviderMigration`; interactive prompt to reuse; non-interactive stale-state exits with hint to use `--recreate-sandbox`/env; always delete-and-recreate when `needsProviderMigration`; add deletion note log.
CLI Flag Addition `bin/nemoclaw.js`	Document and accept `--recreate-sandbox` in usage and allowed args; parse `recreateSandbox` and pass into `runOnboard()`; update unknown-option validation.
Tests `test/onboard.test.js`	Add four tests: non-interactive stale-state exits, env-forced recreate triggers delete+create, interactive reuse (yes) avoids delete/create, interactive decline (no) triggers delete+create; assertions on prompts, runner/spawn calls, and messages.

Sequence Diagram

sequenceDiagram
    participant User as User/CLI
    participant CLI as nemoclaw.js
    participant Onboard as onboard.js
    participant Sandbox as Sandbox System

    User->>CLI: Run `nemoclaw onboard` (maybe --recreate-sandbox)
    CLI->>CLI: Parse args → {recreateSandbox, nonInteractive, ...}
    CLI->>Onboard: runOnboard(opts)
    Onboard->>Onboard: Determine isRecreateSandbox() from opts/env
    Onboard->>Sandbox: Check existing sandbox state & needsProviderMigration

    alt isRecreateSandbox() OR needsProviderMigration
        Onboard->>User: note "Deleting and recreating sandbox '${sandboxName}'..."
        Onboard->>Sandbox: Delete existing sandbox
        Onboard->>Sandbox: Create sandbox
    else !isRecreateSandbox() AND !needsProviderMigration
        alt Interactive
            Onboard->>User: Prompt "Reuse existing sandbox? [Y/n]"
            alt User = n
                Onboard->>User: note "Deleting and recreating sandbox '${sandboxName}'..."
                Onboard->>Sandbox: Delete and Create
            else User = y
                Onboard->>Sandbox: Upsert messaging providers
                Onboard->>Sandbox: Forward dashboard
                Onboard-->>User: Return sandbox name
            end
        else Non-Interactive
            alt Sandbox state == ready
                Onboard->>Sandbox: Upsert messaging providers
                Onboard->>Sandbox: Forward dashboard
                Onboard-->>User: Return sandbox name
            else Sandbox not ready
                Onboard->>User: Print error & hint about --recreate-sandbox
                Onboard->>Onboard: process.exit(1)
            end
        end
    end

Estimated Code Review Effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 I nudged a sandbox, old and new,
A flag to remake, a question or two,
"Reuse or recreate?" — I tap and I ask,
No silent overwrite, now there's a clear task,
I log, I prompt, then hop off to my stew.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 42.86% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: adding a prompt before reusing an existing sandbox during re-onboard, which directly addresses the silent overwrite issue.
Linked Issues check	✅ Passed	All coding requirements from issue `#1102` are met: detect existing sandboxes, prompt before reuse/recreation in interactive mode, require explicit confirmation via --recreate-sandbox flag, and reject non-ready sandboxes in non-interactive mode.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to implementing the --recreate-sandbox feature and interactive/non-interactive sandbox reuse logic as required by issue `#1102`.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (1)

bin/lib/onboard.js (1)
180-188: Minor redundancy in isRecreateSandbox() env check.

At line 3647, RECREATE_SANDBOX is set from opts.recreateSandbox || process.env.NEMOCLAW_RECREATE_SANDBOX === "1", so the env var check in isRecreateSandbox() is redundant—if the env is "1", RECREATE_SANDBOX is already true.

The pattern matches isNonInteractive(), so this is consistent within the file. Consider simplifying if you want to reduce duplication, but it's not harmful as-is.
🔧 Optional simplification
 function isRecreateSandbox() {
-  return RECREATE_SANDBOX || process.env.NEMOCLAW_RECREATE_SANDBOX === "1";
+  return RECREATE_SANDBOX;
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 180 - 188, The isRecreateSandbox() helper
redundantly re-checks process.env.NEMOCLAW_RECREATE_SANDBOX === "1" even though
RECREATE_SANDBOX is already initialized from opts.recreateSandbox ||
process.env.NEMOCLAW_RECREATE_SANDBOX === "1"; update isRecreateSandbox() to
simply return RECREATE_SANDBOX (remove the duplicate env lookup) so the function
mirrors isNonInteractive()’s simplicity and relies on the single source of truth
set when RECREATE_SANDBOX is initialized.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@bin/lib/onboard.js`:
- Around line 180-188: The isRecreateSandbox() helper redundantly re-checks
process.env.NEMOCLAW_RECREATE_SANDBOX === "1" even though RECREATE_SANDBOX is
already initialized from opts.recreateSandbox ||
process.env.NEMOCLAW_RECREATE_SANDBOX === "1"; update isRecreateSandbox() to
simply return RECREATE_SANDBOX (remove the duplicate env lookup) so the function
mirrors isNonInteractive()’s simplicity and relies on the single source of truth
set when RECREATE_SANDBOX is initialized.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: cf0f4bda-ad50-47c8-883d-a2c0a79c08a6

📥 Commits

Reviewing files that changed from the base of the PR and between 364969d and 3a53392.

📒 Files selected for processing (2)

bin/lib/onboard.js
bin/nemoclaw.js

wscurran · 2026-04-06T16:43:06Z

✨ Thanks for submitting this fix, which proposes a way to prompt before reusing an existing sandbox during re-onboarding, improving safety and clarity in the setup flow.

ericksoa

Thanks for tackling this, @paritoshd-nv — the non-interactive error-on-non-ready behavior is a solid improvement over the silent overwrite. A few things to consider:

Interactive path and sandbox state

The non-interactive path correctly distinguishes ready vs not-ready, but the interactive prompt offers "Reuse existing sandbox?" regardless of state. What happens if the user says "yes" to reuse a sandbox that isn't ready? Would it be worth either skipping the prompt (go straight to recreation) or surfacing the state so the user can make an informed choice?

Input parsing

The check answer.trim().toLowerCase() !== "n" means typing "no" is treated as reuse. Would .startsWith("n") be safer here?

Unrelated change in isNonInteractive()

The added process.env.NEMOCLAW_NON_INTERACTIVE === "1" check is already handled at the top of onboard() where NON_INTERACTIVE is set. Could you revert this to keep the PR scoped to the sandbox prompt change?

Tests

Could you add unit tests for the new branching logic? Specifically the interactive ready/not-ready paths and the non-interactive error exit.

paritoshd-nv · 2026-04-06T17:34:06Z

The added process.env.NEMOCLAW_NON_INTERACTIVE === "1" check is already handled at the top of onboard() where NON_INTERACTIVE is set. Could you revert this to keep the PR scoped to the sandbox prompt change?

The tests call createSandbox() directly without going through onboard(), so NON_INTERACTIVE stays false.
The test spawns a subprocess with NEMOCLAW_NON_INTERACTIVE: "1" in the env. But the script calls createSandbox() directly — not onboard() — so NON_INTERACTIVE is never set from the env var. Without the fallback in isNonInteractive(), the code will take the interactive path and hit the prompt, hanging the test.

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@bin/lib/onboard.js`:
- Around line 2009-2011: Update the non-interactive reuse message to include a
recreate hint: when the branch that calls note(`  [non-interactive] Sandbox
'${sandboxName}' exists and is ready — reusing it`) executes (the same place
that calls ensureDashboardForward(sandboxName, chatUiUrl) and returns
sandboxName), append or alter the note text to mention the CLI flag and env var
for forcing recreation (e.g. `--recreate-sandbox` /
`NEMOCLAW_RECREATE_SANDBOX=1`) so users know how to recreate the sandbox in
non-interactive mode.
- Around line 2020-2022: The prompt handling for sandbox reuse only treats "n"
as a negative; update the logic around the promptOrDefault call so answers of
"no" (case-insensitive, trimmed) are also treated as negative. Locate the
variable answer from promptOrDefault and change the conditional that currently
checks answer.trim().toLowerCase() !== "n" to instead reject both "n" and "no"
(for example by normalizing answer and checking it is not "n" and not "no" or by
using an inclusion check), so upsertMessagingProviders(messagingTokenDefs) is
only called when the user really chose to reuse the sandbox.

In `@test/onboard.test.js`:
- Around line 2215-2219: The test currently spreads process.env into the child
env when calling spawnSync (see the env: { ...process.env, HOME: tmpDir, PATH:
`${fakeBin}:${process.env.PATH || ""}` } block), which can inherit
NEMOCLAW_NON_INTERACTIVE or NEMOCLAW_RECREATE_SANDBOX and break interactive path
coverage; fix by constructing a fresh env object (copy needed keys or start from
Object.assign({}, process.env)), then explicitly delete
env.NEMOCLAW_NON_INTERACTIVE and env.NEMOCLAW_RECREATE_SANDBOX before using it
in the spawnSync invocation so the child process always runs the interactive
branch.
- Around line 2010-2056: The test currently only asserts exit code and hint,
which can miss destructive side-effects; update the test so runner.run throws
when called with a "sandbox delete" command (patch runner.run to throw on
delete), stub child_process.spawn (or spawnSync) used by createSandbox to fail
on any sandbox creation attempt, and ensure the env passed to the spawned script
removes NEMOCLAW_RECREATE_SANDBOX so the test exercises the non-recreate branch;
target the runner.run, runner.runCapture, createSandbox invocation and the
child_process.spawn stub to enforce immediate failure on destructive commands
and prevent masked deletions.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 1ec2f39c-1fad-4c94-b98d-ddca48485726

📥 Commits

Reviewing files that changed from the base of the PR and between 6ffc19b and d5cf62b.

📒 Files selected for processing (3)

bin/lib/onboard.js
bin/nemoclaw.js
test/onboard.test.js

bin/lib/onboard.js

test/onboard.test.js

paritoshd-nv · 2026-04-06T19:40:16Z

Unrelated change in isNonInteractive()

The added process.env.NEMOCLAW_NON_INTERACTIVE === "1" check is already handled at the top of onboard() where NON_INTERACTIVE is set. Could you revert this to keep the PR scoped to the sandbox prompt change?

Tests

Could you add unit tests for the new branching logic? Specifically the interactive ready/not-ready paths and the non-interactive error exit.

Hi @ericksoa , thanks for the review.

Added tests for the new branching logic. Please note that new tests add ~1min of time in overall tests.

In the last comment, I have mentioned about the need for : process.env.NEMOCLAW_NON_INTERACTIVE === "1"

ericksoa

Clean PR — the --recreate-sandbox flag, non-interactive error-on-not-ready, and interactive prompt are all good additions. Tests are thorough and follow existing patterns. One thing worth discussing:

Interactive mode doesn't guard on sandbox readiness

The original code only offered reuse when existingSandboxState === "ready". The new code drops that guard in the interactive path:

if (!isRecreateSandbox() && !needsProviderMigration) {
  if (isNonInteractive()) {
    if (existingSandboxState === "ready") { ... }  // ← guarded
    // not ready → exit(1)                          // ← guarded
  }
  // interactive: prompts for ANY existing sandbox, including not-ready
  console.log(`  Sandbox '${sandboxName}' already exists.`);
  const answer = await promptOrDefault("  Reuse existing sandbox? [Y/n]: ", ...);
  if (answer !== "n") {
    return sandboxName;  // ← returns a possibly non-ready sandbox
  }
}

Non-interactive handles this correctly (errors if not ready). But in interactive mode, a user could say "y" to reuse a sandbox stuck in NotReady/Creating — which would return without a working sandbox.

Question: Should the interactive prompt also check existingSandboxState === "ready" before offering reuse? Or at minimum show the current state so the user knows what they're agreeing to?

There's also no test for the interactive + not-ready edge case (the other three quadrants are covered). Adding one would close that gap.

paritoshd-nv · 2026-04-07T00:02:04Z

Question: Should the interactive prompt also check existingSandboxState === "ready" before offering reuse? Or at minimum show the current state so the user knows what they're agreeing to?

There's also no test for the interactive + not-ready edge case (the other three quadrants are covered). Adding one would close that gap.

That was miss from my part. Now, added the "interactive and not-ready sandbox" case with explicitly confirmation from user.
Added test case as well.

Add --recreate-sandbox CLI flag and interactive confirmation prompt when a sandbox with the same name already exists during onboarding. In non-interactive mode, ready sandboxes are reused with a hint about the new flag; non-ready sandboxes now exit with an error instead of silently recreating. Closes NVIDIA#1102 Signed-off-by: Paritosh Dixit <paritoshd@nvidia.com>

Guard the interactive path on sandbox readiness. When a sandbox exists but is not ready, prompt the user to confirm deletion and recreation instead of silently proceeding. Selecting n aborts onboarding. Add test for the interactive + not-ready edge case. Signed-off-by: Paritosh Dixit <paritoshd@nvidia.com>

paritoshd-nv · 2026-04-07T13:05:58Z

Hi @ericksoa,

Due to new PR merges in the main, there were merge conflicts in this PR today morning.
Resolved the merge conflicts and PR is ready to merge.

ericksoa

Looks good — the interactive readiness guard, "no" handling, and test coverage all address the earlier feedback. Approving.

One nit for a follow-up PR: the isNonInteractive() change (adding || process.env.NEMOCLAW_NON_INTERACTIVE === "1") is still here — that's already handled where NON_INTERACTIVE is initialized in onboard(), so it's redundant and out of scope for this PR. Same applies to the matching pattern in isRecreateSandbox(). Could you clean both up in a follow-up?

paritoshd-nv · 2026-04-07T17:46:05Z

The test spawns a subprocess with NEMOCLAW_NON_INTERACTIVE: "1" in the env. But the script calls createSandbox() directly — not onboard() — so NON_INTERACTIVE is never set from the env var. Without the fallback in isNonInteractive(), the code will take the interactive path and hit the prompt, hanging the test.

It was needed to run the tests (without this change tests were failing):
The test spawns a subprocess with NEMOCLAW_NON_INTERACTIVE: "1" in the env. But the script calls createSandbox() directly — not onboard() — so NON_INTERACTIVE is never set from the env var. Without the fallback in isNonInteractive(), the code will take the interactive path and hit the prompt, hanging the test.

When openshell sandbox create exits with SSH 255 after printing "Created sandbox:", NemoClaw previously hard-exited instead of checking whether the sandbox reached Ready state. This was a regression from the create-stream extraction (#1516) combined with the messaging provider migration path (#1081, #1527) that forces sandbox recreation. Two fixes: - streamSandboxCreate: do one final readyCheck on non-zero close to catch the race where the sandbox is already Ready when SSH dies. - onboard.js: when failure is sandbox_create_incomplete, fall through to the existing ready-wait loop (60s polling) instead of exiting. Signed-off-by: Aaron Erickson <aerickson@nvidia.com>

…1598) ## Summary - When `openshell sandbox create` exits with SSH 255 after printing "Created sandbox:", NemoClaw now treats this as recoverable instead of hard-exiting - Added a final `readyCheck` in `streamSandboxCreate` on non-zero close to catch the race where the sandbox is already Ready when SSH dies - In `onboard.js`, `sandbox_create_incomplete` failures now fall through to the existing 60s ready-wait loop instead of calling `process.exit()` ## Root cause Regression from #1516 (create-stream extraction) combined with #1081/#1527 (messaging provider migration forcing sandbox recreation). The create stream returns non-zero after "Created sandbox:" and NemoClaw exits before checking if the sandbox reached Ready state. ## Test plan - [x] New unit tests: stream recovers when readyCheck is true at close time (SSH 255) - [x] New unit test: stream still returns non-zero when readyCheck is false at close time - [x] All existing `sandbox-create-stream` tests pass (7/7) - [x] All onboard integration tests pass (83/83) - [x] All src unit tests pass (538/538) - [ ] Manual: trigger SSH 255 during sandbox create and verify recovery  ## Summary by CodeRabbit * **Bug Fixes** * Enhanced sandbox creation error handling with improved failure classification and recovery messaging. * Improved sandbox readiness detection to prevent unnecessary creation failures when the sandbox is operational.  Signed-off-by: Aaron Erickson <aerickson@nvidia.com>

paritoshd-nv requested a review from ericksoa April 6, 2026 15:05

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

paritoshd-nv mentioned this pull request Apr 6, 2026

[NemoClaw][spark] Re-onboard with existing sandbox name silently overwrites instead of rejecting #1102

Closed

paritoshd-nv force-pushed the fix/1102-recreate-sandbox-prompt branch from 058101f to 0e9bded Compare April 6, 2026 17:11

ericksoa reviewed Apr 6, 2026

View reviewed changes

paritoshd-nv force-pushed the fix/1102-recreate-sandbox-prompt branch from 6ffc19b to 79feed8 Compare April 6, 2026 18:15

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

bin/lib/onboard.js Show resolved Hide resolved

bin/lib/onboard.js Outdated Show resolved Hide resolved

test/onboard.test.js Show resolved Hide resolved

test/onboard.test.js Outdated Show resolved Hide resolved

paritoshd-nv force-pushed the fix/1102-recreate-sandbox-prompt branch from d5cf62b to ed401f4 Compare April 6, 2026 18:55

ericksoa reviewed Apr 6, 2026

View reviewed changes

paritoshd-nv force-pushed the fix/1102-recreate-sandbox-prompt branch 2 times, most recently from 6416db0 to 0f9eefa Compare April 6, 2026 23:46

paritoshd-nv added 2 commits April 7, 2026 12:26

paritoshd-nv force-pushed the fix/1102-recreate-sandbox-prompt branch from 0f9eefa to e134f9a Compare April 7, 2026 12:42

Merge branch 'main' into fix/1102-recreate-sandbox-prompt

54ee4c5

ericksoa approved these changes Apr 7, 2026

View reviewed changes

ericksoa merged commit 57a93dd into NVIDIA:main Apr 7, 2026
8 checks passed

ericksoa mentioned this pull request Apr 8, 2026

fix(onboard): recover from SSH 255 when sandbox was already created #1598

Merged

6 tasks

Conversation

paritoshd-nv commented Apr 6, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issue

Changes

Type of Change

Testing

Checklist

General

Code Changes

Doc Changes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated Code Review Effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

wscurran commented Apr 6, 2026

Uh oh!

ericksoa left a comment

Choose a reason for hiding this comment

Uh oh!

paritoshd-nv commented Apr 6, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

paritoshd-nv commented Apr 6, 2026

Uh oh!

ericksoa left a comment

Choose a reason for hiding this comment

Interactive mode doesn't guard on sandbox readiness

Uh oh!

paritoshd-nv commented Apr 7, 2026

Uh oh!

paritoshd-nv commented Apr 7, 2026

Uh oh!

ericksoa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

paritoshd-nv commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

paritoshd-nv commented Apr 6, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 6, 2026 •

edited

Loading

paritoshd-nv commented Apr 7, 2026 •

edited

Loading