wip ci(smoke): enable agent isolation on smoke runs#525
Conversation
Set agent.isolation=true in tests/experiments/default.yaml and update the e2e smoke job to ubuntu-24.04 with bubblewrap+socat installed and AppArmor's unprivileged-userns restriction relaxed, so the SDK's bwrap sandbox actually engages instead of silently falling back to no-op. Mirrors UiPath/coder_eval#207. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
Claude finished @akshaylive's task in 1m 39s —— View job Review: PR #525 — ci(smoke): enable agent isolation on smoke runs
SummaryEnables filesystem-perimeter sandboxing ( Change-by-Change Review1.
|
| Area | Status | Notes |
|---|---|---|
| Frontmatter | N/A | No skill files changed |
| E2E Tests | N/A | Test config updated, not test tasks |
| Skill Body | N/A | No skill files changed |
| References & Assets | N/A | No skill files changed |
| Repo Hygiene | OK | Scoped to CI + test config, no secrets, no cross-skill refs |
Issues for Manual Review
- Confirm the upstream
UiPath/coder_eval#207change is merged/available so the SDK actually honorsisolation: truewhen it receives it. - Per the PR's test plan: spot-check one task's
sdk_options.json(ortask.json) in a CI run to confirmsandbox.enabled=truewas sent to the SDK.
Conclusion
Clean, well-documented CI change. The runner pin and sandbox dependency installation are correctly implemented with appropriate fallbacks. No issues found — approve.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
f0e938a to
cf23341
Compare
|
After debating back and forth, while this change works, it's very brittle. Closing this with no-op. |
Summary
Mirrors UiPath/coder_eval#207.
Test plan
🤖 Generated with Claude Code