fix(connect): fail fast when gateway is down#3853
Conversation
Signed-off-by: Chengjie Wang <chengjiew@nvidia.com>
📝 WalkthroughWalkthroughWhen the OpenShell gateway becomes disconnected or unreachable, ChangesGateway Availability Detection and Recovery
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~22 minutes Suggested labels
Suggested reviewers
Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches📝 Generate docstrings
🧪 Generate unit tests (beta)
Comment |
E2E Advisor RecommendationRequired E2E: Dispatch hint: Full advisor summaryE2E Recommendation AdvisorBase: Required E2E
Optional E2E
New E2E recommendations
Dispatch hint
|
PR Review AdvisorRecommendation: blocked This is an automated advisory review. A human maintainer must make the final merge decision. Limitations: Review used provided trusted metadata and read-only repository inspection; no tests, package-manager commands, scripts, or workflows were executed.; CI and E2E results were evaluated only from the provided GitHub/status context; several checks were still IN_PROGRESS.; No live OpenShell gateway or real sandbox was available to validate docker-kill behavior.; Issue and PR text were treated as untrusted evidence and only used for acceptance mapping. Full advisor summaryPR Review AdvisorBase: The connect behavior change is directionally correct and has a focused unit regression, but merge is blocked by pending CI/mergeStateStatus=BLOCKED, missing required sandbox E2E evidence for this head SHA, and monolith growth in connect.ts. Gate status
🔴 Blockers
🟡 Warnings
🔵 Suggestions
Acceptance coverage
Security review
Test / E2E status
✅ What looks good
Review completeness
|
Summary
nemoclaw <sandbox> connectreadiness polling when the named NemoClaw/OpenShell gateway is down or unreachablenemoclaw onboardProvisioningunknownreadiness status plus disconnected gateway lifecycleFixes #3821
Test Plan
npm run build:clinpm run typecheck:clinpm test -- test/cli.test.ts -t "fails fast with gateway recovery guidance"npm test -- test/sandbox-stuck-recovery.test.tsnpm test -- test/cli.test.ts -t "connect|gateway metadata exists|gateway is no longer configured|gateway recovery guidance|sandbox readiness"git diff --checkSummary by CodeRabbit
Bug Fixes
connectcommand now detects gateway unavailability immediately and fails fast with recovery guidance, instead of waiting for a timeout.Signed-off-by: Chengjie Wang chengjiew@nvidia.com